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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4892: 
CTCTTAACCT TGCAGCACCG GGCAGGCGTC AGCCCGTATA CATCACCTTA CGGTTTGCAG 
AGACCGTGTG TTTTTGAGAA ACAGGTGC 
(2) INFORMATION FOR SEQ ID NO: 4 893: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4893: 
TAGCGAATGA GATAAACGCG GCGGACTGTA AATCCGCTCC TTCGGGTTCG GCAGTTCGAA 
TCTGCCCCCC TCCATTTATT ATTTTnAAAA AAAGCATAGT TC 
(2) INFORMATION FOR SEQ ID NO: 4894: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4894: 
CGAGCGATTC CGACTCAGAC AATGACTCGG AATTCAGATA ACGAATCTGA CTCAGACAGT 
GACTCAGATT CCGACAGT 

(2) INFORMATION FOR SEQ ID NO: 4895: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 895: 
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TTAGTATTCG CACT CACATG TCACATGCTT CACTCGACCT ATTACCTCAC ACTTGGGGAC 120 
TTATACCGAG TnGGAATCTC A 141 
(2) INFORMATION FOR SEQ ID NO: 4896: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4896: 
GTTCGGATTT AATTTGATTC ATTTGTTGCG TAATTTCAGA AGC CATTTTA TGAAAAGAGT 60 
GATTTAATTC ATAAATTTCT 80 
(2) INFORMATION FOR SEQ ID NO: 4897: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4897: 
GGTATGGCTG GCGCTTTACG TGCCACATTA GATTATGTCA CTGAGCGTAA GAAGTTCGGC 60 
AAACCAATTA GTA 73 
(2) INFORMATION FOR SEQ ID NO: 4 8 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4898: 
GTGCCGAGGA CCGGAATCGA ACCGGTACGG TGATCACTCA CCGCAGGATT TTAAGTCCTG 60 
TGCGTCTGCC A 71 
(2) INFORMATION FOR SEQ ID NO : 4899: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4899: 
AGCCGCTGGA TTCCACGGTG TAAGCAGTAA GGCTGAGTAT TAGGCAAATC CGGTACTCGT 
TAAGGCTGGA GCTGTnGATG GGGAGAAGAC ATGTGTGCTT ACGTAGTGCG TGGTTT 
(2) INFORMATION FOR SEQ ID NO: 4900: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 113 base pairs 

(B) TYPE ; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4900: 
TTCCGGCCCC TGGCGGTnTC CTAGGAGTTC CACGGAACCC CAGGATTTTG TAGGACCCCC 
CGGGCAGAGT CCCCTTCGAC TAAGAAAATC TCACATTCTT CAGGGACTTT TAC 
(2) INFORMATION FOR SEQ ID NO: 4 901: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4901: 
ACTTAATGGA TTGATTAAGT AGTGGGTTCT TAACATTAGG CCTCAGCTAA TGTGGTACTT 
AAAAATAGGG AATACATG 

(2) INFORMATION FOR SEQ ID NO: 4 902: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4902: 
ATGGGACATG GTGTGCCTCC TTG CTATAGT CACCAGACAT ATGAATGTAA TTTATA CATT 
CAAAACTAGA TAGTAAGTAA AAGT 
(2) INFORMATION FOR SEQ ID NO: 4903: 
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(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4903: 
TTGTCGGGTA AGTTCCGACC CGCACGAAAG GCGTAACGAT TTTGGCACTG TCTCAACGAG 6 0 

AGACTCGGTG 7 0 

(2) INFORMATION FOR SEQ ID NO: 4904: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4904: 

TAAGTTGGCT ACCATCGACG CTAAGAACCT TTCTTGACTT GTGACAATCG CTTGCTTCTT 6 0 

TCCTCTTCTT GGCTCTGCTT CTATTTGCCT CTAATGTGGC TTTTTGTTGT AATAAG TTAT 12 0 

GCGCTTCGGT GTTATnGTGA AAAAGCGGTC A 151 
(2) INFORMATION FOR SEQ ID NO: 4905: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4905: 
TGACTTGTAG GCGCTGGTGG GATACTACCC TAGCTGTGTT GGCTTTCTAA CCCGCACCAC 60 
TTATCGTGGT GGGGA 75 
(2) INFORMATION FOR SEQ ID NO: 4906: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 112 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 9 06: 
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ACTCAGACAG TGACTCAGAT CAGATAGTGn CTCGGATTCA GCGATTATTC AG 112 
(2) INFORMATION FOR SEQ ID NO: 4 907: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4907: 
GTAAAAACAC CCCAGCCAGG TCACTTTACG CCTACGCATC GCTTGTACAC GTGCTACTAA 60 
AGGTTTACCA 70 
(2) INFORMATION FOR SEQ ID NO: 4908: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 908: 
AGAAGATACA AATAAAGnTA AACCCAAATT ATTCAATTTC GGTGGGACAC AATAGTGTTG 60 
ACTTTGAAGA AGATACACTT TCCCACCAAG TTAAGTGGGT CCATAATTGA AGGGTCCAAC 12 0 

C 121 
(2) INFORMATION FOR SEQ ID NO: 4909: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4909: 

GGGAACAAAG TGGACAGGTG GTGCATGGTT GTCGTCAACT CGTGTCGTGA GATGTTGGGG 60 

TTAAGTCCCG 7 ° 
(2) INFORMATION FOR SEQ ID NO: 4910: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4910: 
GGACACCCGG AGAACTGAAA CATTTAGTAC CCGGAGAAGA GAAG AAA TTC GTTCCCTTAG 6 0 

TAGCGGGACG AACGGGAGAG CCCAAACCAC AGCTTGTGnG 100 
(2) INFORMATION FOR SEQ ID NO: 4911: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4911: 
nTCTGTGTTC GGCATGGGAA CAGGTGTGAC CTCCTTGCTA TAGTCACCAG ACAAAATAAC 60 
TTACTTATTC ATGATACACG CTTAATTAAT AAGGTGCAAT AGTTAATTTT ACACTTTGAT 120 
GTAAACTTTG GCACATCAGC TTTTAGTGTG TTTCG 155 
(2) INFORMATION FOR SEQ ID NO: 4912: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 912: 

CCCGTCACAC CACGAGATTT GTAACACCCG AAAGCCGGTG GAGTGAACCT TTTAGGAGCT 60 

AGCCGTCGA 6 9 
(2) INFORMATION FOR SEQ ID NO: 4913: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 913: 
TATGCTCTCC CAGCTGAGCT AATTCTCCGA TTTAAAACTG CCTGGCAACG TTCTACTCTA 6 0 

GCGGAACGT 6 9 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4914: 
CAAATGCTTC TGTCAATTCG ATCAAATCAA TGTCT TCAAC AGATAGATTG CTGAGTGACA 
ATACTTCAT 

(2) INFORMATION FOR SEQ ID NO: 4915: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4 915: 
TATTTCGGAG AGAACCAGCT ATTTCCAGGC TCGATTGGAA TTTCTCCGGT ACCCTCAGTT 
CATCCGCTCA 

(2) INFORMATION FOR SEQ ID NO: 4916: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4916: 
TCACCCATTA ACGGGCTCTA ACTACTTGTA AG CACACGGT TTCAGGTTCT ATTTCACTCC 
CCTTCCG 

(2) INFORMATION FOR SEQ ID NO: 4917: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4917: 
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TCCAC CGTTG ACTAAGGTTC CAGAGAGGCT CGTCCCTCTG GGTAGTCGGG CCTAAGCTGA 120 
GGCGCACGTA GGCGATGATA CAGGTTATAT CCTnACACCT A 161 
(2) INFORMATION FOR SEQ ID NO: 4 918: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4918: 
GCACGTAGTT AGCCGTTGGC TTTCTGATTA GGTACCGGCA AGATGTGCAC AGTTACTTAC 6 0 

ACATATGT 68 
(2) INFORMATION FOR SEQ ID NO: 4 919: 

(i) SEQUENCE CHARACTERISTICS .- 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4919: 
G AAAG CAAAT GTCTTCGTTG CAATTCGATC AAATCAATGT CTTCAACAGA TAGATTGCTG 60 
AGTGACAATA CTTCAGGG 78 
(2) INFORMATION FOR SEQ ID NO: 4920: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4920: 
AATTTCGCTA CCTTAGGGAC CGTTATAGTT ACGGC CGACG TTTACTGGGG CTTCGATTCG 60 
TACTTCGC 6 8 

(2) INFORMATION FOR SEQ ID NO: 4921: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4921: 
AATCATAGTA CCTGTAAAGA TGCAGGTTAC CCGCAACAAG ACGGAAAGAC CCCGTGGAGC 
TTTACTG 

(2) INFORMATION FOR SEQ ID NO: 4922: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 922: 
GGGAGCTGTA ATAAGCTTTG ATCCAGAGAT TTC CGAATGG GGAAACC CAG CATGAGTTAT 
GTCATG 

(2) INFORMATION FOR SEQ ID NO: 4923: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4923: 
GTGGAGGCGC TGGTGGGATA CTACCCTAGC TGTGTTGGCT TTCTAACCCG CAACCACTTA 
TCGTGGG TGG GAG 

(2) INFORMATION FOR SEQ ID NO: 4 924: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4924: 
GAGGTCAGCG GTTCGATCCC GCTAGTCTCA C CATTTATTT TTTACACGAT GACATGAAAC 
TGATGACATA TGCACCGTAA TTCCAAAAA 
(2) INFORMATION FOR SEQ ID NO: 4925: 



3122 



EP0 786 519 A2 



(A) LENGTH: 68 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4925: 
AATGGCATCG GAGTTGTCTG AATTCGGTAA CCCGAGAGGC CCCTCGTCCA AACAGGCTCT 
ACCTCCAA 

(2) INFORMATION FOR SEQ ID NO: 4926: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 926: 
ACGAAAGGCG TAACGATTTG GGCACTGATC TCGACGAGAG ACTCGTGAAA ATCATAGATA 
CCTGTGAAGA TGC 

(2) INFORMATION FOR SEQ ID NO : 4 927: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 927: 
ATTTCTATTA ATTATTTGCA TAGAAATCAG CTTTTTTGAT ATGTATTTTA TAATGTACAG 
CTCGTTGAG 

(2) INFORMATION FOR SEQ ID NO: 4 928: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4928: 
AGGTATGTCC ATCCCGGTCC TCTCGTACTA AGGACATCTC CTCTCAAATT TCCTACGCCC 
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(2) INFORMATION FOR SEQ ID NO: 4929: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4929: 
GGCGGGGCCC CAACACAGAG GCTGGCGGAA AGTGCAGCTT ACAATAGTGT GCAAGTTGGG 
GTGGGTCCCG ACACAGAGAA ATT 
(2) INFORMATION FOR SEQ ID NO: 4930: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 930: 
CTTGGTAGAG CACTTG GTTT GGG AC CAAGG GGT CGCAGTT CGAATCCTGT CTTCCCGATT 
ACTCTA 

(2) INFORMATION FOR SEQ ID NO: 4 931: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 931: 
AAGAAGCTTG TTATGCTGCA ACACCAGCAA TTCAAGTAGC TAAAGATTAT TTAGCAACTA 
GACCG 

(2) INFORMATION FOR SEQ ID NO: 4 932: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TGCAGAAGAG GAAAGTGGAA TTCCATGTGT AGGGTGAAAT GCGCATAGAT ATGG CAGGAA 
CACCAGTGGG CGA 

(2) INFORMATION FOR SEQ ID NO: 4933: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4933: 
TCGCTATCCG AGTCTGGGTC GCTATCTGAG TCTGAGTCGC TATCTGAGTC TGAAATCGCT 
GTCTG 

(2) INFORMATION FOR SEQ ID NO: 4934: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4934: 
GAGCATGTGG GTTTAATTCG AAGCAACGCA GAGAACCTTG AC CAAAT CTT GGACATCCnT 
TGACAACTCT GAGAGATTAG AGCCTTGCCC CTTGCGGGGT AACAAATGTA CAGGTTGGTG 
(2) INFORMATION FOR SEQ ID NO: 4935: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 935: 
GAAGGCTCGT CCGCTCTGGG TTAGTCGGGT CCTAAGCTGA GGCCGAGCAG GTACGGCGAT 
GGATAACGGT TG 

(2) INFORMATION FOR SEQ ID NO: 4936: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 936: 
CACATCAGCG TCAGTTACAT ACCAGAAAGT CGCCTTCGCC ACTGGTGTTC CTCCATATCT 
CTGC 

(2) INFORMATION FOR SEQ ID NO: 4 937: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4937: 
ATCACCTGGT GTAAACCAAT TAGTACGTGT ATATATCGTT CAAAACCGTA AAATTCATGT 
TGGT 

(2) INFORMATION FOR SEQ ID NO: 4 938: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 3 base pairs 
!B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4938: 
CTGGGTAGCT ATGTGTGGAC GGGATAAGTG CTGAAACATC TAAGCATGAA GCCCCCTCAA 
GAT 

(2) INFORMATION FOR SEQ ID NO: 4 939: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4939: 
TAGAAATCAG CTTTTTTGCA TATGTATTTT ATAATGTACA GCTCGTTGAG CTGCTATTTT 
CCT 

(2) INFORMATION FOR SEQ ID NO: 4 940: 
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(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4940: 
GGTAGTCGAA CTTACGTTCC GCTAGAGTAG AACTTGCCAG CAAATGACAA ATCGGAGAAT 
TAGCTCAG 

(2) INFORMATION FOR SEQ ID NO: 4 941: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4941: 
CTGAAACATC TTAGTACCCG GAGAAGAGAA AGAAAATTCG ATTCCCTTAG TAGCGGCGAG 



(2) INFORMATION FOR SEQ ID NO: 4 942: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4942: 
TTACAACAAT GCAATTGGCG GGGCCCAACA TAGAACTGGC AATAGTTACT TTCAATAATG 
TGCAAGTTGG GGTAGGGCCC AACACAGAA 
(2) INFORMATION FOR SEQ ID NO: 4 943: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4943: 
AATGGACAAT ATGTCAACGT TAATTCCAAA AAACGGTAAC TATAATTTAC AAACATTATT 
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(2) INFORMATION FOR SEQ ID NO: 4944: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4944: 

CCCGTGAAAG ATGATGAGGT TAATAGGTTT GAGGTGGAAG CATGGTGACA TGTGGAGCGT 6 0 

J 5 GACGAATACG TAATTGA 7 7 

(2) INFORMATION FOR SEQ ID NO: 4945: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 945: 
GAATAGGTC G ATGTATTTGG CGACCGAAAC CAGGAATCTA CCCTTGGTCA GGTTGAAGTT 
CAGGTAACAC TGAAT 

(2) INFORMATION FOR SEQ ID NO: 4946: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 946: 

ATCAGAAGGC ACACCCGAGA ATGAAACATC TAGTACCCGG AGAAGAGAAA GAAATTCGAT 6 0 

TCCCTATAGC GGCGnCGAAA CGGAAGAGCC AACCACAGCT GCTTTGGTTT GACCCATCGG 12 0 

TACAGCGCTG ACACT 13 5 
(2) INFORMATION FOR SEQ ID NO: 4 947: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4947: 
GATTTGGGCT CTTC CATTTC GCTCGCCGCT ACTAAGGGAA TCGAATTTCC TTTCTCTTCC 



(2) INFORMATION FOR SEQ ID NO: 4 948: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4948: 
GCTTTTGAGG AATTTAATAT CAATATATAT TCTGTGTATG AAAATATTTT TCATAAAAAT 
TGTTTGAATC 

(2) INFORMATION FOR SEQ ID NO: 4 949: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4949: 
CTTCGACCTG ACCAAGGGTA GATCACCTGG TTTCGGGTCT ACGACCAAGT ACTAAACGCC 
CTATTC 

(2) INFORMATION FOR SEQ ID NO: 4 950: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4950: 
GGGTGTTACA AAATATCGTG GTGTGACGGG GCGGTGTGTA CAAGACCCCG GGTACGTTTT 
TCACCGTAGG CATGCTGG 

(2) INFORMATION FOR SEQ ID NO: 4951: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 61 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4951: 
CCGCGATAAT AAAAAATAAT GGCGGAGGAA GAGGGATTCG AACCCCCGTG GCCCGTTAAG 



(2) INFORMATION FOR SEQ ID NO: 4 952: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4952: 
ACTTATCTAG TTTTCAATGT ACAATGTCTT TTTAGTCAGC GCTCGCATAA G CAAT ATCAC 
TGTAACCA 

(2) INFORMATION FOR SEQ ID NO: 4953: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 953: 
ACTGACGCTG ATGTGCGAAA GCGTGGGGAT CAAACAGGAT TAGATACCCT GGTAGTCCA 
(2) INFORMATION FOR SEQ ID NO: 4954: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4954: 

CGTGAGCGCT AGCTGTACGT GGAGGCGCTG GTGGGATACT ACCCTAGCTG TGTTGGCTTT 

(2) INFORMATION FOR SEQ ID NO: 4 955: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 59 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 955: 
AAGGCGTAAC GATTTGGGCA CTGTCTCAAC GAGAGACTCG GTGAAATCAT AGTACCTGT 
(2) INFORMATION FOR SEQ ID NO: 4956: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 9 56: 
AATAGGTG CC CGTACCGCAA ACCGACACAG GTAGTCAAGA TGAGAATTCT AAGGTGAGC 
(2) INFORMATION FOR SEQ ID NO: 4957: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 957: 
ACACGTTTAG TTCACGCGGG TTTTGCTTGG TAAAATCTAT ATTTTACTTA CTTATCTAGT 
TTTCAATGTA CAATTTC 

(2) INFORMATION FOR SEQ ID NO: 4958: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4958: 
TGTCGTCAGC TCGTGTCGTG AGATGTTGGG TTAAGTCCCG CAACGAGCGC AACCCTTAA 
(2) INFORMATION FOR SEQ ID NO: 4959: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4959: 
GACTTGAACC CCCAACCTAC TGATTACAAG TCAGTTGCTC TACCAATTGA GCTAGGCCG 5 9 

(2) INFORMATION FOR SEQ ID NO: 4960: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4960: 
TATTAACTGG TGACTTAGCG AGATGGACAA CGATGGCGAT ATATTTATTA TTGACCGCA 59 
(2) INFORMATION FOR SEQ ID NO: 4961: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4961: 
ATGATTATCA TATTGCCTTA AATATCGCCA CAATTTTAGC GGGTGGTGAT TTACCAAGA 5 9 

(2) INFORMATION FOR SEQ ID NO: 4962: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4962: 
CGCCGAAAAT TGTACCGGGG CTAAACATAT TACCGAAGCT GTGGATTGTC CTTTGGACA 5 9 

(2) INFORMATION FOR SEQ ID NO: 4963: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



55 
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TTAAGCTTAG TTGCCATCAT TAAGTTGGGC ACTCTAATGT TGACTGCCGG TGACAAACCG 
(2) INFORMATION FOR SEQ ID NO: 49G4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49S4 : 
TAGTTTTGGT ACGGTAACAA ATTTATTTGA AGAAACAATC TCAAATAAAG AATTGTTTG 
(2) INFORMATION FOR SEQ ID NO: 4965: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4965: 
AGTTCGGTCC CTATCCGTCG TGGGGCGTAG GAAATTTTAG AGGAGCTGTC CTTAGTACG 
(2) INFORMATION FOR SEQ ID NO: 4966: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4966: 
TACAAAGGGC AGCGAAACCG TGAGTCAAAG CAAATCCCAT AAAGTTGTTC TCAGTTCGG 
(2) INFORMATION FOR SEQ ID NO: 4967: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 967: 
CCATTGCAGC ACCGACAGCC ACACCATTTA TTTTTGCGCT ATGTATATCG CATTAAACGT 
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(2) INFORMATION FOR SEQ ID NO: 4968: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4968: 
GCCTCCGTTA CTTTTAGAGG CGACGCCCAG TCAAACTGCC CGCTGACACT GTCTCCCACC 
(2) INFORMATION FOR SEQ ID NO: 4 969: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4969: 
CAAGGAATTT CGCTACCTTA GGACCGTTAT AGTTACGGCC GCCGTTTACT GGGGCTTCG 
(2) INFORMATION FOR SEQ ID NO: 4 970: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4970: 
CATCTGTCAC AGGTACTATG GATTTCACCG GAGTCTCTCG TTGAGACAGT GCCCAAATCG 
TTACG 

(2) INFORMATION FOR SEQ ID NO: 4971: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4971: 
CTAGAGTTGG CATACCAGTC CGTCGCTAAG GAACCTTTCT TGACTTGATG ACAATCGACT 
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(2) INFORMATION FOR SEQ ID NO: 4972: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4972: 
AGACCCCGTG GAGCTTTACT GTAGCCTGCA TATTCGAATT CGGCACAGCT TGTACAGGAT 
AGGTAGG 

(2) INFORMATION FOR SEQ ID NO: 4973: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4973: 
TGTGTACTTA AAAATATGAA TACATGAGTA AAACTCATGC ATAAGAAATA CTAATTTC 
(2) INFORMATION FOR SEQ ID NO: 4974: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4974: 
ACCTCCAATA ATCATCACTT GAGGCTAGCC CTAAAGCTAT TTCGGAGAGA ACCAGCTA 
(2) INFORMATION FOR SEQ ID NO: 4975: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4975: 
GACGCCACAT CCTTTTCCAC TTGGCGTGTG TTTTGGGACC TTGGCTGGTG GTCTGGGCTG 
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(2) INFORMATION FOR SEQ ID NO: 4976: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4976: 
TCACTAAGTC CGTCTTTCGA CCCTGACTCG ACTTGTAGGT CTCGCAGTCA AGCTCCCTT 5 9 

75 (2) INFORMATION FOR SEQ ID NO: 4 977: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4977: 

25 

TAAGTAAGTA AAATATAGAT TTTACCAAGC AAAACCGAGT GAATAAAGAG TTTTAAAT 58 

(2) INFORMATION FOR SEQ ID NO: 4978: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4978: 
GTGAGGTACG TCCAGAATGG GGATTGGCTA AAAATGCATC ATTTATAATT GGACGACG 58 
(2) INFORMATION FOR SEQ ID NO: 4979: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4979: 
TCGCTATCTG AATCCGAGGT CGCTATCCGA ATCTGGAGTC GCTATCTGGA GTCTGGAATC 60 
(2) INFORMATION FOR SEQ ID NO: 4980: 
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(A) LENGTH: 58 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 980: 
AAGAAGGTAA TAATCCTGTA GTCGAAAATG TTGTCTCTCT TGAGTGGATC CTGAGTAC 
(2) INFORMATION FOR SEQ ID NO: 4 981: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY -. linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 981: 
GTGGTGTGTT AGGGCACTCT ATACGGGTTA CAAAGTACGA CATTAGACGG ATCATCTGGA 
AAGATGAATC AAAGT 

(2) INFORMATION FOR SEQ ID NO: 4982: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4982: 
GTTTTTCATT TTCATTTATA TTTATATTAG TGTTAATCCA ATCATAGATT TATCTATA 
(2) INFORMATION FOR SEQ ID NO: 4983: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4983: 
TGACAGCTAA CTATTGGCCA GCTTCTATGT TGGGGCCCCG CCAACTTGCA TTGTCTGTAG 



(2) INFORMATION FOR SEQ ID NO: 4984: 
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(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4984: 
GGTGTTTTTA AGTCTGATGT GAAAACCCAC GGCTCAACCG GTAGAGGTCA TTGGAAACTG 
GAAA 

(2) INFORMATION FOR SEQ ID NO: 4985: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4985: 
CAGCATACAA TAATGTGCAA GTTGGTCGGG GCCCCAACAT AGAGAATTTC AAAAGAAAT 
(2) INFORMATION FOR SEQ ID NO: 4 986: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4986: 
TGATGAAGGT CTTCGGATCG TAAAACTCTG TTATTAGGGA AGAACATATG TGTAAGT 
(2) INFORMATION FOR SEQ ID NO: 4987: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4987: 
AAAGAATTTG CGCAAAACGC TATCGATACT GAAGGGCGTT CAATGATTAT CATGGGT 
(2) INFORMATION FOR SEQ ID NO: 4988: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4988: 
TTGTCACAAG TCAAGAAAGG TCTTTAGCGA CGATGGTAGC CAACTTACGT TCCGCTA 
(2) INFORMATION FOR SEQ ID NO: 4989: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4989: 
TAGGACCGTT ATAG TTACGG CCGCCGTTTA CTGGGGCTTC GATTCGTAGC TTCGCAG 
(2) INFORMATION FOR SEQ ID NO: 4990: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4990: 
AAAGTCAACG TTAATTCCAA AAAACGTAAC TATAAGTTAC TAACATTATT TAGTATTTAT 



(2) INFORMATION FOR SEQ ID NO: 4991: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4991: 
TCCACGGGGT CTTTCCATCC TGTCTCGGGT AACCTGCATC TTCACAGGTA CTATGATTTC 
ACCGA 

(2) INFORMATION FOR SEQ ID NO: 4992: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4992: 
GTGCGGCTGG ATCACCTCCT TTCTAAGGAT ATATTCGGAA CATCTTCTTC AGAAGAT 
(2) INFORMATION FOR SEQ ID NO: 4993: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4993: 
CCAATGCGGC TCATCGCATC CATTTTTTGC CTGGCAACGT TCTACTCTAG CGGAACG 
(2) INFORMATION FOR SEQ ID NO: 4994: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4994: 
CTGGTTTCGG GTCTACGACA AATACTAAAC GCCTATTCAG ACTCGCTTTC GCTACGG 
(2) INFORMATION FOR SEQ ID NO: 4995: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4995: 
CAAACTCTCG TGGTGTGACG GGCGGTGTGG TACAAGCCCC GGGAACGTAT TCACCGTAGC 
ATGCCGGTCT ACG 

(2) INFORMATION FOR SEQ ID NO: 4996: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4996: 
ACTAATTTC T ATAGGAAAAG TATTCTTTAT CGTCGTCCCA CCCCAACTCG CACATTATTG 
TAAGC 

(2) INFORMATION FOR SEQ ID NO: 4997: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4997: 
ATTGGATTCC CAATTTCTAC AGACAATGCA GTTGGGGTGG GACGTCGAAA TAAATTTT 
(2) INFORMATION FOR SEQ ID NO: 4998: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4998: 
AACTTCAACA GGTTTTGCAG GTGGCGGTGC GACTGCAGGA AGACGTTAAA TTAATG 
(2) INFORMATION FOR SEQ ID NO: 4 999: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4999: 
GGGCAGGCGT CAGCCCTATA CATCACCTTA CGGTTTAGCA GAGACCTGTG TTTTTG 
(2) INFORMATION FOR SEQ ID NO: 5000: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5000: 
TTAGGACCCG ACTAACCCAG AGCGGACGAG CCTTCCTCTG GAAACCTTAG TCAATC 
(2) INFORMATION FOR SEQ ID NO: 5001: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5001: 
GACGGGATTC TCACCCGTCT TTCGCTACTC ACACCGG CAT TCTCACTTCT AAGCGC 
(2) INFORMATION FOR SEQ ID NO: 50 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5002: 
CCTGTCGGCC TCAGCTTAGG ACCCGACTAA CCCAGAGCGG ACGAGCCTTC CTCTGG 
(2) INFORMATION FOR SEQ ID NO: 5003: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5003: 
TTACTCATTT AGCTCTACTA AACTCGTTGC GCTCTTTTCT CGTTTCGTCA GATTCA 
(2) INFORMATION FOR SEQ ID NO: 5004: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5004: 
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(2) INFORMATION FOR SEQ ID NO: 5005: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5005: 
GACATAGATT AAGTTATTAA GGGCGCACGG TGGATGCCTT GGCACTAGAA GCCGATG 57 
15 (2) INFORMATION FOR SEQ ID NO: 5006: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5006: 
GGAACCACTA CATAATAAAT CATTAGTGGC TCTTTATCAT TCTGTCCCAC TCCCCTG 
(2) INFORMATION FOR SEQ ID NO: 5007: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5007: 
CGCTACGTTT TTCCTCAACA TTTAAGAAAA TAAAGAATGC TACAATTACG ATTGCAATCA 
AACCAAAGA 

(2) INFORMATION FOR SEQ ID NO: 5008: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5008: 
AAAAGCTTGT TACAAGCGCT ATTTTCGTTC AGTCAACTAC TGCCAATATA ACTTCGT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5009: 
ACATCTTAGT ACCCGGAGAA GAGAAAGAAA ATTCGATTCC CTTAGTAGCG GCGAGC 
(2) INFORMATION FOR SEQ ID NO: 5010: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5010: 
GTTTGGACGA GGGGCCCTCT CGGGTTACGA TTCAGACAAA CTCCGATGCA ATTAATTTAC 
TTGGG 

(2) INFORMATION FOR SEQ ID NO: 5011: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5011: 
TTTACTTGTA TTGGAATAAT TTCATCTTTG AACCGACCAT CACGTTGTGC GTCATA 
(2) INFORMATION FOR SEQ ID NO: 5012: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5012: 
CCTTATACAT CACCTTTACG GTTTTAGCAG AAACCTGTGT TTTTTGATAA ACAGTCG CTT 
(2) INFORMATION FOR SEQ ID NO: 5013: 
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(A) LENGTH : 6 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5013: 
GGGCCCAACA CAGAAGCTGG CGAAAAGTCA GCATACAAAA TTGTGCAAGT TGGCGGGTCC 
ACA 

(2) INFORMATION FOR SEQ ID NO: 5014: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5014: 
TAGGGTAGTA TCCCACCAGT GGCCTCCACG TAAGCTAGCG CTTCACGTTT CAAAGGCT 
(2) INFORMATION FOR SEQ ID NO: 5015: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5015: 
CAACTCCGAT GCAATTAATT TAACTTGGGA nCAGACATGG GTGATAAGGT CGTGTTCGAA 
GGGAACAGCC AGACACAGCT AAGTC CAG AT ATATGTAAGT GGAAA 
(2) INFORMATION FOR SEQ ID NO: 5016: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5016: 
CAAACGATTT ATTTATATGC TTATTTCTTT ATTTATTATT ATTACAATTA CATTTT 
(2) INFORMATION FOR SEQ ID NO: 5017: 
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(A) LENGTH: 65 base pairs 

(B) TYPE : nucl ic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5017: 
TCTGTAGAAA TTGGGGAATC CAATTTCTCT TGGTTTGGGT CCCATCCCCA ACTTGCACAT 
TATTG 

(2) INFORMATION FOR SEQ ID NO: 5018: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5018: 
GGGGATGGGC CCCAACAAAG AGAAATTGGA TTCCCAATTT CTACAGACAA TGCAAC 
(2) INFORMATION FOR SEQ ID NO: 5019: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5019: 
CGATT GGATT GCACGTCTAA GCAGTAAGGC TGAGTATTAG GCAAATCCGG TACTC 
(2) INFORMATION FOR SEQ ID NO: 5020: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5020: 
TACGGCGTTT AGTATTTGGT CGTAGCCGGA AACCAGGTGA TCTACCCTTG GTCAGGTT 
(2) INFORMATION FOR SEQ ID NO: 5021: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5021: 
TTGTGCGGGT CCCCGTCAAT TCCTTTGAGT TTCAACCTTG CGGTCGTACT CCCCA 
(2) INFORMATION FOR SEQ ID NO: 5022: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5022: 
AGAGTTCACA TCGACGGGGA GGTTTGGCAC CTCGATGTCG GCTCATCGCA TCCTG 
(2) INFORMATION FOR SEQ ID NO: 5023: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5023: 
CCTTAGCTGG TGGTCTGGGC TGTTTCCCTT TCGAACACGG ACCTTATCAC CCATGGTC 
(2) INFORMATION FOR SEQ ID NO: 5024: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5024: 
TAAAGGCTAA ACTACCAATG TTTTCAATGG ATTTCCAAAA TGAATCATCT GGGAT 
(2) INFORMATION FOR SEQ ID NO: 5025: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 025: 
TCAAACCGGA GGGGTCATTG GAAACTGGAA AACTTGAGTG CAGAAGAGGA AAGTGGAATT 
CCA 

(2) INFORMATION FOR SEQ ID NO: 5026: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5026: 
GCTTGCGTAC CTAAAGAGCA CCCCTTCTCC GAATTACGGG GTCATTTTGC CGAGTTCCTT 60 
20 AACG 64 

(2> INFORMATION FOR SEQ ID NO: 5 027: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(i) 

25 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5027: 
AATA CAT AG C ATATCATAAG GCACACCCGG AGAACTGAAA CATCTTAGTA CCCGGA 56 
35 (2) INFORMATION FOR SEQ ID NO: 5028: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5028: 

45 

GGGCCCCTCT CGGGTTACCA ATTCAGACAA ACTCCGAATG CCAATTAATT TAACT 55 
(2) INFORMATION FOR SEQ ID NO: 5029: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5029: 
CGTAACTATA ACGGTCCTAA GGTAGCCGAA ATTCCTTGTC GGGTAAGTTC CGAC 54 
(2) INFORMATION FOR SEQ ID NO: 5030: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 0: 
ATCCCCGGGG CCCCAACACA GAGAATTTCG AAAAGAAATT CTACAGGCAA TGCAAGT 
(2) INFORMATION FOR SEQ ID NO: 5031: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5031: 

3Q TGCTTGGTAA AATCTATATT TTACTTACTT ATCTAGTTTT CAATGTACAA TTTC 54 

(2) INFORMATION FOR SEQ ID NO: 5032: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
35 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5032: 
TCCCTATCCG TCGTGGGCGT AGGAAATTTG AGAGGAGCTG TCCTTAGTAC GAGA 54 
(2) INFORMATION FOR SEQ ID NO: 5033: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5033: 

55 
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(2) INFORMATION FOR SEQ ID NO: 50 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5034: 
TATCTGAATC TGAGTCGCTG TCTGAATCTG AATCGCTATC CGAGTCTGAG TCGC 
(2) INFORMATION FOR SEQ ID NO: 5035: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5035: 
CACCAGCAAT TCAATTAGCT AAAGATTATT TAGCAACTAG ACCGGAATGA AAAA 
(2) INFORMATION FOR SEQ ID NO : 5036: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5036: 
CTACCATCGA CGCTAAGAGC TTAACTTCTG TGTTCGGCAT GGGAACAGGT GTGA 
(2) INFORMATION FOR SEQ ID NO: 5037: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5037: 
CGGCGAGTTA CGATTTGATG CAAGGTTAAG CAGTAAATGT GGAGCCGTAG CGGAAG 
(2) INFORMATION FOR SEQ ID NO: 5038: 
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(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 038: 
GACCGGGATG GACATACCTC TGGTGTACCA GTTTGTCGTG CCAACGCATA AGCT 
(2) INFORMATION FOR SEQ ID NO: 503 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5039: 
TTTCACCGAG TCTCTCGTTG AGCAGTGCCC AAATCGTTAC GCCTTTCGTG CGGG 
(2) INFORMATION FOR SEQ ID NO: 5040: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5040: 
TTATAACACG TATGCTTGGG GAGTGTAATA AGCTTGATCA GAGATTCCnA TGGGAACCAG 
CATGAGTTAT GTCATTTATC ATATGTGATC ATAGCATATC AAAGG 
(2) INFORMATION FOR SEQ ID NO: 5041: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5041: 
GGCACCTATT TTCTATCTAG AGGCT TTT CT CGGCAGTGTG GAAATCAACG ACTC 
(2) INFORMATION FOR SEQ ID NO: 5042: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5042: 
ACTTATCTAG TTTTCAATGT ACAAATAATG GTGGAGACTA GCGGGATCGA ACCG 
(2) INFORMATION FOR SEQ ID NO: 5043: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5043: 
CGTTCAGTCA ACTACTGCCA ATATAACATC GTAGAGCATA GAACATTGAT TTA 
(2) INFORMATION FOR SEQ ID NO: 5044: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5044: 
ACTAGAGAGT ATTT AG A CTT AGGAGTGGTC CTCCCAGATT CCGACGGATT TCACGTGCT 
(2) INFORMATION FOR SEQ ID NO: 5045: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5045: 
TTTCGTCGTC CCACCCCAAC TTG CACGCTA TTGTGAGCTG ACTTTTCGCC AGCTTCTGTG 



(2) INFORMATION FOR SEQ ID NO: 5046: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 6: 
CGTCATCCCC ACCTTCCTCC GGTTTGTCAC CGGCAGTCAA CTTAGAGTGC CCA 
(2) INFORMATION FOR SEQ ID NO: 5047: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 7: 
TATTTATTAA ACCTAATAAA GATGAATTAG AAGTGATGTT TAATACAACA GTG 
(2) INFORMATION FOR SEQ ID NO: 504 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5048: 
CCTCGGCACC ATTTTCAATA AAAACATATG CGCCCGTAGC TCAATTGGAT AGA 
(2) INFORMATION FOR SEQ ID NO: 5049: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5049: 
AAATGGCCAA CTTAAGCCAG GATACAATTT ACAAATAGCG ACAAATTCTC AATGTGTTT 
(2) INFORMATION FOR SEQ ID NO: 5050: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3153 



EP0 786 519 A2 



CGAAAGACAC nCACAAGATT AATAACGCGT TTGAGTCTTT TTATAAAGAC GTACTTCATG 60 
TTACGTTGAC TTTAAGATGG TGGAACATAG TTAGTTTTAG GGCCACGGGG G 111 
(2) INFORMATION FOR SEQ ID NO : 5051: 



(i) SEQUENCE CHARACTERISTICS: 
[A) LENGTH: 53 base pairs 
!B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5051: 
ACGTTGACAT ATTGTCATTC AGTTTTCAAT GTTCATTAAT GTTCAATCTC TTT 
(2) INFORMATION FOR SEQ ID NO: 5052: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5052: 



CTGAATTCGG TAACCCGAGA GGGCCCCTCG TCCAAACAGT GCTCTACCTC CAA 53 

30 

(2) INFORMATION FOR SEQ ID NO: 5053: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5053: 
TACATAGCAT ATCAGAAGGC ACACCCGGAG AACTGAAACA TCTTAGTACC CGG 
(2) INFORMATION FOR SEQ ID NO: 5054: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5054: 



55 
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(2) INFORMATION FOR SEQ ID NO: 5055: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5055: 
TAGGGCTAGC CTCAAGTGAT GATTATTGGA GGTAGAGCAC TGTTTGGACG AGG 
(2) INFORMATION FOR SEQ ID NO: 5056: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5056: 
ACTGCTTTGA AATGTTGTTG TCTCAGTTAA TATTCAATTG CTTTT CTTTT GCATAATTGA 
CAA 

(2) INFORMATION FOR SEQ ID NO: 5057: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5057: 
CTATGTTGGG GCCCCGCCAA CTTGCATTGT CT GTAGAATT CCTTTTCGAA ATTC 
(2) INFORMATION FOR SEQ ID NO: 5058: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5058: 
CTGAATCCGA GTCGCTATCC GAGTCTGAGT CGCTATCTGA GTCTGAATCG CTG 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5059: 
TTTGGGCTAC ACACGTGCTA CAATGGACAA TACAAAGGGC AGCGAAACCG TGATGTCAA 
(2) INFORMATION FOR SEQ ID NO: 5060: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 0: 
AAATAACTAA TAACATTTAG AGG CTGGGAC ATAAATCCCT AAAAAACAGC AGT 
(2) INFORMATION FOR SEQ ID NO: 5061: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5061: 
TTAAAGAGTG CGTAATAGCT CACTAATCGA GTGACAATGG CGCCGAAAAT GTACC 
(2) INFORMATION FOR SEQ ID NO: 5062: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5062: 
TACCAGGGTA TCTAATCCTG TTTGATCCCC ACGCTTTCGC ACATCAGCGT CA 
(2) INFORMATION FOR SEQ ID NO: 5063: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5063: 
TCGCCAAGCC ATTTTTCTTT GTGTTTCTTT TTATTTTGAC GTTTTAGACA TA 
(2) INFORMATION FOR SEQ ID NO: 5064: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5064: 
CTCAAGCTCC CTTGTGCCTT TACACTCTAT GTGTGCTTTC CGAC CGTTCT GGGGGGAACT 
TTGGAGCGCC TCCGTT 

(2) INFORMATION FOR SEQ ID NO: 506 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5065: 
CCACCGCTTG TGCGGGTCCC CGTCAATTCC TTTGAGTCGC AACCTTGCGG TCGT 
(2) INFORMATION FOR SEQ ID NO: 5066: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5066: 
TCCCCGGGCT AAACATATTA CCGAAGCTGT GGATTGTCCT TTGGACAATG GTAGG 
(2) INFORMATION FOR SEQ ID NO: 5067: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5067: 
GCTCATCGCA TCCACTTTTT GCCTGGCAAC GTTCTACTCT AGCGGAACGT AA 
(2) INFORMATION FOR SEQ ID NO : 506 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5068: 

CGGACACCAG AAGTAC CAAG CGAGCCGGAA ACACCAACAC CACCGACACC AGACGTACAA 

GCGAGCGGAA ACAACAACAA CG 

(2) INFORMATION FOR SEQ ID NO: 506 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 506 9: 
TCCATAACGT GCTGTATCTG TAGCAATAAC TAATACTTTT TCATTCGGTC TA 
(2) INFORMATION FOR SEQ ID NO: 507 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5070: 
CGACTCGAAG ACTCAATGTC TTCTCCCCAT CACAGCTCAG CCTTAACGAG TA 
(2) INFORMATION FOR SEQ ID NO: 5071: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5071: 
GCGATGGATA ACAGGTTGAT ATTCCTGTAC CACCTATAAT CGTTTTAATC GA 52 
(2) INFORMATION FOR SEQ ID NO: 5072: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



1S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5072: 

CACTTCACCA CAGCCGCCAT GG CATGGGCA GTAGGAATCG AACCCACACC AAA 5 3 

(2) INFORMATION FOR SEQ ID NO: 5073: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5073: 
GTCCAGACTC CTACGGGAGG CAGCAGTAGG GAATCTTCCG CAATGGGCGA AA 52 
(2) INFORMATION FOR SEQ ID NO: 5074: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5074: 
TGTTGGCTTA GAATCAGCCA TCATTTAAAG AGTG CGTAAT AGCTCACTAG TCGC 
(2) INFORMATION FOR SEQ ID NO: 5075: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5075: 



55 
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AAACTTCCCT TTGG 

(2) INFORMATION FOR SEQ ID NO: 5076: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5076: 

js TGAGCTAATC AGACATCATA ATTTTTATGG AGAGTTTGAT CCTGGCTCAA GAT 53 

(2) INFORMATION FOR SEQ ID NO: 5077: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5077: 
GTTAAGCTCC TTAGCGTCGA TGGTAGTCGA ACTTACGTTC CGCTAGAGTA GA 
(2) INFORMATION FOR SEQ ID NO: 5078: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5078: 
GGGTAAGTTC CGACCCGCAC GAAAGGCGTA ACGATTTGGG CACTGTCTCA AC 
(2) INFORMATION FOR SEQ ID NO: 50 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5079: 
ATTAATCCAT TGTGC CACAA GTGC CGGACC AGAAATGATG GTATTTAATA AT 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5080: 
AATAGCGACG AATCACACCT TCTGCGGCAC CACGGAATAA TACACCATGT GG 
(2) INFORMATION FOR SEQ ID NO: 5081: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5081: 
CTTTCGCTAC TCACACCGGC ATTCTCACTT CTAAGCGCTC CACATGTCCT TA 
(2) INFORMATION FOR SEQ ID NO: 5082: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5082: 
CAGACAGCGA CTCAGATCAG ATAGCGATCG GACTCAGACA ACGACTCAGA TTCAGA 
(2) INFORMATION FOR SEQ ID NO: 5083: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE-, nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5083: 
AGGGTATGAT CCATCCCGGG TCCTCTCGGT ACTAAGGGAC AGCTCCTCTC AAATTTCCTA 



(2) INFORMATION FOR SEQ ID NO: 5084: 
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(A) LENGTH : 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5084: 
GTGTACCAGT TGTCGTGCCA ACGGCATAGC TGGGTAGCTA TGTGTGGACG AG 
(2) INFORMATION FOR SEQ ID NO: 5085: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5085: 
TTGCTTACGC TTGCTATAGC CAATATATAG ATGTTGGAGG GGGCAGATTC GAATGCGAAC 
CCGAGGAGCG GATTAACA 

(2) INFORMATION FOR SEQ ID NO: 5086: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5086: 
ACTGACGCTG ATGTGCGAAG CGTGGGGATC ACACAGGATT AGATACCCTG GT 
(2) INFORMATION FOR SEQ ID NO: 5087: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5087: 
AAGTTGTTCT CAGTTCGGAT TGTAGTCTGC AACTCGACTA CATGAAGCTG G 
(2) INFORMATION FOR SEQ ID NO: 5088: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5088: 
AATACTTTAA AAAAATAAGA CACTTTGGCC AACTTAAGCC AGGATACAAT T 
(2) INFORMATION FOR SEQ ID NO: 5089: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5089: 

20 

AGAACGTAAA TTTAATCCTG ATTTAGCACC AGGGACAGAA AAAGTAACAA G 51 

(2) INFORMATION FOR SEQ ID NO: 5090: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5090: 
CCGCTACCCT CAGTTCATCC GCTCACTTTT CAACGTAAGT CGGTTCGGTC C 51 
(2) INFORMATION FOR SEQ ID NO: 5091: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5091: 
CATGGGTGAT AAGGTCCGTG TTCGAAAGGG AAACAGCCCA GACCACCAGC T 51 
(2) INFORMATION FOR SEQ ID NO: 5092: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5092: 
TTGTGCGGAT CCCCGTCAAT TCCTTTGAGT TTCAACCTTG CGGTCGTACT CCCCA 
(2) INFORMATION FOR SEQ ID NO: 5093: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5093: 
CCGCAAACCG ACACAGGTAG TCAAGATGAG AATTCTAAGG TGAGCGAGCG A 
(2) INFORMATION FOR SEQ ID NO: 5094: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5094: 
ACGAATACTA ATCGATCGAA GACTTAATCA AAATAAATGT TTTGCGACAA A 
(2) INFORMATION FOR SEQ ID NO: 5095: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 095: 
AATCGTTGTT ACCTTCATTA TACCACTTAC TTGTGGAGTG ATCTTCTTCA AAGTAACACT 
ATTGTGCCAC CGATTGA 

(2) INFORMATION FOR SEQ ID NO: 5096: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTCGTAAGGA CATGTGGAGC GCTTAGAAGT GAGAATGCCG GTGTGAGTAG CG 
(2) INFORMATION FOR SEQ ID NO: 5097: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5097: 
GGGTCTGTTT TCTAATTTGA GCACAATCTT CGTTCTCAAT AGAATGATTT A 51 

15 

(2) INFORMATION FOR SEQ ID NO: 50 98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5098: 
CTGTAGAAAT TGGGAATCCA ATTTCTCTAT GTTGGGGCCC ACACCCCAAC T 
(2) INFORMATION FOR SEQ ID NO: 50 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5099: 
CTGTGGATTG TCCTTTGGCA TGAGATAGGA GAGCGTTCTA AGGGCGTTGA A 
(2) INFORMATION FOR SEQ ID NO: 5100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5100: 
GGTTCAGAAC GTCGTGAGAC AGTTCGGTCC CTATCCGTCG TGGGCGTAGG A 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5101: 
TGACTGATAC GATCAATGCG CTTGTAACAA GCTTTTTTCA ATTCTAGTCA GGGGCCCCAA 
CACA 

(2) INFORMATION FOR SEQ ID NO: 5102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5102: 
TATTTGCATT TATATTGACG TTTTAAACAT AAAAAAAGAA ACCTTGCGGT CTCAATGG 
(2) INFORMATION FOR SEQ ID NO: 5103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5103: 
ATTACCATTT TGGTATACCA CATCGTTATT CAACACATTA T CTTTTACTT 
(2) INFORMATION FOR SEQ ID NO: 5104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5104: 
GGACACTGCT CCCT CAGGAG TCTCGCCATT AATACTACGT ATTAACATGT AAT 
(2) INFORMATION FOR SEQ ID NO: 5105: 



3166 



EP 0 786 519 A2 



(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5105: 
ATTTCTTCTT TTTCTACAAT CGAGTCTCCT TTTACAGGTC CATATTTTGT 
(2) INFORMATION FOR SEQ ID NO: 5106: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5106: 
TGGTCAAGCG GTTAAGACAC CGCCCTTTCA CGGCGGTAAC ACGGGTTCGA 
(2) INFORMATION FOR SEQ ID NO: 5107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5107: 
TTTTTAG CAG TAATTGCCAC ATCTGTGTGA CGATAATGAT ATGCGACAGT 
(2) INFORMATION FOR SEQ ID NO: 5108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5108: 
CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 
(2) INFORMATION FOR SEQ ID NO : 5109: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5109: 
AGCCGATGAA GGACGTTACT AACGACGATA TGCTTTGGGG AGCTGTAAGT 
(2) INFORMATION FOR SEQ ID NO: 5110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5110: 
ATCCATAACG TTTGGTATCT GTAGCAATAA CTAATACTTT TTCATTCGGT CTA 
(2) INFORMATION FOR SEQ ID NO: 5X11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5111: 
CCCTTAGTAC TGCAGCTAAC GCATTAAGCA CTCCGCCTGA GGAGTACGAC CTC 
(2) INFORMATION FOR SEQ ID NO: 5112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5112: 
GGAGTCAAAG TCCGTTGCCT TACCGCTTGG CTATAGCC CA ATATATAGAT 
(2) INFORMATION FOR SEQ ID NO: 5113: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AGAnTTTCTT TTCGAAATTC TTTATGTTGG GGCCCCGCCA ACTTGCATTG 
(2) INFORMATION FOR SEQ ID NO: 5114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5114: 

JS GCTGTGATGG GGAGAAGACA TTGTGTCTTC GAGTCGTTGA TTTCACACTG 50 

(2) INFORMATION FOR SEQ ID NO: 5115: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 55 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5115: 
CTTGATTAAC TCATTATCAA GTTATGCACG TGTAAATGAA TTCGGCTTTA TCGAA 
(2) INFORMATION FOR SEQ ID NO : 5116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5116: 
GTACTTAAAA AATGAATACA TGAGTAAAGC TCATGGCATA AGAAATACTA 
(2) INFORMATION FOR SEQ ID NO: 5117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5117: 
CAGAATCAGA TAGCGACTCA GAATCAGATA GTGAGTCAGA TTCAGCAGTn 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5118: 
CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT TTGCGACnAA 
(2) INFORMATION FOR SEQ ID NO: 5119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B ) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5119: 
AATCGTCGAA ACTTAATCAA AATAAATGTT TTGCGACAAA TCACTTTTAC TTACTATCTA 
(2) INFORMATION FOR SEQ ID NO: 5120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5120: 
CCGAAGTTTA CGGGGTCATT TTGCCGAGTT CCTTAACGAG AGTTCGCTCG 
(2) INFORMATION FOR SEQ ID NO: 5121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5121: 
ATAATCCTGT AGTCGAAAAT GTTGTCTCTC TTGAGTGGAT CCTTAGTACG CCGGGGCACG 



(2) INFORMATION FOR SEQ ID NO: 5122: 
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(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5122: 
TAGCGACTCA GATTCAGACA GCGATTCAGA CAGCGACTCA GACTCAGATA 
(2) INFORMATION FOR SEQ ID NO: 5123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 5123: 
TATTTGTAAA TTGTATCCTG GCTTAAGTTG GCCATTTTTC ATATGGTCTT 
(2) INFORMATION FOR SEQ ID NO: 5124: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5124: 
CACGAGCTGA CGACAACCAT GCACCACCTG TCACTTTGTC CCCCGAGAAG GTCTCTATCT 
(2) INFORMATION FOR SEQ ID NO: 5125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5125: 
AGAACACATT GCAGTCTTCG AGTCGTTGAT TTCACACTGC CGTAGTAAAA GCCTCTAGAT 
AGAA 

(2) INFORMATION FOR SEQ ID NO: 5126: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
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(C) STRANDEDNESS .- double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5126: 
GGACGACATT AGACGAATCA TCTGGAAAAG ATGAGTCAAA GAAGGTAATA GTCCTGT 5 7 

(2) INFORMATION FOR SEQ ID NO: 5127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5127: 

20 

GTC CTTAGTA CGAGATGACC GGGATGGACA TACCTCTGGT GTACCAGTTG 50 

(2) INFORMATION FOR SEQ ID NO: 512 8: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5128: 
GCCGATTTTA GCAGTTGTTG CTTCGTTCAA TTTTATGGGG CCATTTATGG 50 
35 (2) INFORMATION FOR SEQ ID NO: 5129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 9: 

45 

TCCACACATA GCTACCCAGT CTATGCCGTT TGCACTACAA CTGGTACACC A 51 
(2) INFORMATION FOR SEQ ID NO: 5130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5130: 
CTCAGTTGGT TGAGCATCTG ACTTTTAATC AAAGGGTCAA GGGTCGAATC CTCTTTT 
(2) INFORMATION FOR SEQ ID NO: 5131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5131: 
TGATATTCAC TTCAATGTTA TCAATATTAG TGCCATCTAT GACATCTGCC A 
(2) INFORMATION FOR SEQ ID NO: 5132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5132: 
TCCTGTAGTC GAACGTGTTG TCTCTCTTGA GTGGATCCTG AGTACGACGG A 
(2) INFORMATION FOR SEQ ID NO: 5133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5133: 
ACTCAGACAT TGGATTCAGA TTCAGACAGC GACTCAGATT CAGATAGCGA CTCAGATTC 
(2) INFORMATION FOR SEQ ID NO: 5134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5134: 
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(2) INFORMATION FOR SEQ ID NO: 513 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5135: 
CAATACnAAG GGCAGCGAAA CCGTGATTCA AGCAAATCCC ATAAAGTTGG 
(2) INFORMATION FOR SEQ ID NO: 5136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5136: 
GAAGCTGGCG AAAAGTCAGC TTACAAAATG TGCCAGTTGG CGGGGGCCCA AC 
(2) INFORMATION FOR SEQ ID NO: 5137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5137: 
TTGTAAGGAA GATGCTCTCC CAGCTGAGCT AATTCTCCAA AATAATGAnT 
(2) INFORMATION FOR SEQ ID NO: 5138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5138: 
CTGTACAAGC TGTGCCGATA TTTCAATATC AnGnTACAGT AnAGCTCCAC 
(2) INFORMATION FOR SEQ ID NO: 5139: 
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(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5139: 
GGTTCGAATC CTGTCTTCCC GATATACTGT AATTATTATG GGGGCTTTGC TC 
(2) INFORMATION FOR SEQ ID NO: 5140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 0: 
GCTACTGAAC CTATAAAATG TATGATGGCG GTCTCGAGGG AATCGAACCC AGATCT 
(2) INFORMATION FOR SEQ ID NO: 5141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5141: 
AGGAGG CTCG TCCGCTCTAG GTTAGTCGGG TCCTAAGCTG AGGCCGACAG TA 
(2) INFORMATION FOR SEQ ID NO: 5142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5142: 
CACGCTTTCG ACATCAGCGT CAGTTACAGA CCAGAAAGTC GCCTTCGCAC TGGTGT 
(2) INFORMATION FOR SEQ ID NO: 5143: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5143: 
CTCCTAAAAC AATTTACATC CAAACCTTCA TCACTCACGC GGCGTTGCTC CGTCAGCTTT 
CGCCATTGCG AAGAT 

(2) INFORMATION FOR SEQ ID NO: 5144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5144: 
GAGGAGTGGT TAGCTTCTGC GGAGTACGAA TCGAGCCCCG TAAACGGCGG CCGGT 
(2) INFORMATION FOR SEQ ID NO : 514 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 5: 
GCCTAATACT CAGCCTGACT GCTTAGACGT GCAATCCATA TCAGCACGAT TCG 
(2) INFORMATION FOR SEQ ID NO: 514 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5146: 
AGCTGTGGAT TGTCCTTTTG GCATGGGTAA GGAGAACGTT CTAAGGGCGT T 
(2) INFORMATION FOR SEQ ID NO: 5147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 514 7: 
CTCTTTGATC CGTAG TCAAA CGCTCTATCC AATTGAGCTA CGGGCGCATA 
(2) INFORMATION FOR SEQ ID NO: 5148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5148: 
GAACTAATTC TCCAAAATAA TGACTCCTAC AGGAACTCGA ACCCGTATAA CCGCCGTAAA 
(2) INFORMATION FOR SEQ ID NO: 5149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 9: 
AAAGCACACC CGGAAAACTG AAACATCTTA AGTACCCGGA GnAAGAGAAA 
(2) INFORMATION FOR SEQ ID NO: 5150: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5150: 
GTAAGTTCGC ACTACCATCG ACGCTAAGGG AGCTTAACTT CTGTGGTTCG GCATGG 
(2) INFORMATION FOR SEQ ID NO: 5151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5151: 
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(2) INFORMATION FOR SEQ ID NO: 5152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5152: 
CTATTCTCTG TGTCGGGCTC ACCCCAACTT GCACACTATT GTAAGCTGAC TTTCCTCCA 
(2) INFORMATION FOR SEQ ID NO: 5153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5153: 
TAACCACATC ATTCGGATAC TGTTCATGGA AGTAATTTCT CTATTCTTCG GACCAGGAAA 
ATACA 

(2) INFORMATION FOR SEQ ID NO: 5154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5154: 
CCATATGTGT CAGTCACTGT GCACATCTTG ACGGTACCTA ATCAGACAGC CACGGCTAAC 
TCCGTGCCAG CCGCCGCGGT ACTACGTGGT G 
(2) INFORMATION FOR SEQ ID NO: 5155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5155: 
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(2) INFORMATION FOR SEQ ID NO: 5156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5156: 
GACTCTGGGA GGACCATCTC CTAAGGCTAA ACTACTCTCT CGTGACCGAT AGTGAACC 
(2) INFORMATION FOR SEQ ID NO: 5157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5157: 
TTGTAACTCC GGTATAG GAG TGTCCTACAA CCCCAACAAG CAAG CTTGTT G 
(2) INFORMATION FOR SEQ ID NO: 5158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5158: 
TAGTnACCGn TAGTGAACCA TTACCGTGAG GGAAAGGTGA AAAGCACCCC 
(2) INFORMATION FOR SEQ ID NO: 5159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5159: 
TTCGCTCGCC GCTACTCAAG GGAATCGCAT TTCCTTTCTC TTCCTCCGGG T 
(2) INFORMATION FOR SEQ ID NO: 5160: 
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(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5160: 
AACTGGAGAA ACAACTTTAT GGGGTTTGCT TGAACCTCGC GGTTTCGCTG CCCTTTGTAT 



(2) INFORMATION FOR SEQ ID NO: 5161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5161: 
AGACCTCTGC CTTACCACTT GGCTATGCGC CAATAACTGG GCTACCTGGA T 
(2) INFORMATION FOR SEQ ID NO: 5162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5162: 
CCGCTCACTT TTCAACGTAA GTCGGATCGG TCCTCCATTC AGTGTCACCT GAACT 
(2) INFORMATION FOR SEQ ID NO: 5163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5163: 
CTTGATCTGT ATTTAAAATG ATATTTTCTA TCTTTTCTTT ATTATTAACG TCTATGACGT 
CGTAGTATAA GATTCCGTGT A 
(2) INFORMATION FOR SEQ ID NO: 5164: 
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(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5164: 
GTTCGAATCC CGTCTTCTGC TCCATTATTT TGCCGGGGTG GCGGCAACTC TCACACGCAC 



(2) INFORMATION FOR SEQ ID NO: 5165: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5165: 
CAGGGGTCTT TCGTCCTGTG TGGGTAACTG CATCTTCACA GGTACTATGA TTTCA 
(2) INFORMATION FOR SEQ ID NO: 5166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5166: 
AAGCTGAGGC CGACAGCGTA GGCGATGGAT GACAGGTTGA TATTGCCGGT ACCACCCGAT 



(2) INFORMATION FOR SEQ ID NO : 516 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5167: 
CTTGCGTCTC CAGTTCCGCC ACCCCGGCAC TATAAAAATG GAGCAGAAGA CG 
(2) INFORMATION FOR SEQ ID NO: 5168: 
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(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5168: 

TCGTCCGCTA TCTCCACCAT TATTTGTACA TTGAAACTAG ATAAGTAGTA AATATA 5 6 

(2) INFORMATION FOR SEQ ID NO: 516 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5169: 
CTTGGACGTC GGTGGGTAGT CGAAACTTAC GTTCCGCTAG AGTAGAACGT TGCCA 
(2) INFORMATION FOR SEQ ID NO: 5170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5170: 

35 CTACTGCTGC ATATGCGGGT GAAGGGAGTC GAACCCCCAC GCCGTAAGCA TA 52 

(2) INFORMATION FOR SEQ ID NO: 5171: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5171: 
GGGCTTnGGA CATTAAGTTC TnAGGCAATG TAAAAAAGCT GATTTCTATT 50 
(2) INFORMATION FOR SEQ ID NO: 5172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5172: 
AAATTACATG TTAATACGTA ATTAATGGCG AGACTCCTGA GGAACCATTG CCATTCGAAG 
(2) INFORMATION FOR SEQ ID NO: 5173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5173: 
GTTTTATTTT TTTAAAGTAT GTAAAAGTAA AATTACATG n TAATACGTAn 
(2) INFORMATION FOR SEQ ID NO: 5174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5174: 
CAACTCTCGT TAAGGAACTC GGCAAAATAC CCCGTAACTT CGGAGTAGGT CTCTTTA 
(2) INFORMATION FOR SEQ ID NO: 5175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5175: 
GGAAACTGGA ATAAATAAAC ATTCAATTGA GCGCATCAGT ATTAGGATTC ACTCTAAAAA 
(2) INFORMATION FOR SEQ ID NO: 5176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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GTAACGGACG CGCTCAAAGG TTCCCTCACA ATGGTTGGAA ATCATTCATA 
(2) INFORMATION FOR SEQ ID NO: 5177: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5177: 
AGTTACGTTC TTTTATAAAG AGGTTTGAGA CGCGTTATTA ATCTTGTGAG 
(2) INFORMATION FOR SEQ ID NO: 5178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5178: 
GTGTGGATCC TGAGTACGAC GGAGCACGTG ACAATTCCGT CGGCAATCTG GGG 
(2) INFORMATION FOR SEQ ID NO: 5179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5179: 
CGGTAACTTC ATACCTTTTA ACATATTTTG CATTTGATTG CGTTTACCTT TTTTACCTTT 
ACCGCCACCA GTGAACTGTT TCA 
(2) INFORMATION FOR SEQ ID NO: 5180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5180: 
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(2) INFORMATION FOR SEQ ID NO: 5181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5181: 
GTTCGGTAAC TCGAGCGGGT CTCATAACCC GTAGGTCGGG TGGTTCAAAT CCGTCCTCCC 
GCAATAT 

(2) INFORMATION FOR SEQ ID NO: 5182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5182: 
TAGCGGTGAA ATGCGCAGAG ATATGGAGGA ACACCAGTAG CGAAGGCAAC TTTCT 
(2) INFORMATION FOR SEQ ID NO: 5183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5183: 
GAATTCCATG TGTAGCGGTG AAATGCGCAG AGATATGnAC GAACACCAAT 
(2) INFORMATION FOR SEQ ID NO: 5184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5184: 
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(2) INFORMATION FOR SEQ ID NO: 5185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5185: 
GAGACCTTGT CGGTCCATAT GCGGCTCATC GCATCCATTT TTTGCCTGGC AACGT 
(2) INFORMATION FOR SEQ ID NO: 5186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 6: 
CCATCATTAA GTTGGGCACT CTAAGTTGAC TGCCGGnGnC AC CnAAGAAG 
(2) INFORMATION FOR SEQ ID NO: 518 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5187: 
TTAATACGTT GCAATCCAAT CGCACGCTTC GCCTATCCTA CTGCCnTCCC 
(2) INFORMATION FOR SEQ ID NO: 5188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5188: 
AGTCGCTATC CGAATCTGAG TCGCTATCTG AGTCTnAGTC GCTAnnCCAG 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 9: 
ATAGATGCCC TTACCGCAAA CCGACACATG TAGTCAATAT GATAATTCTA AGGT 54 
(2) INFORMATION FOR SEQ ID NO: 5190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5190: 
CTTTAATGGG CGAACAGnAC CCTTGGACCG ACTACAGCCC AGATCGATGA 5 0 

25 

(2) INFORMATION FOR SEQ ID NO: 5191: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5191: 
ACATAG CAC C CAGCGAGCCG TTGGACGACA ACTGGTACAC CAGAGGATGC CATCCGGCCT 60 
(2) INFORMATION FOR SEQ ID NO: 5192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5192: 

Met Lys Met He Asn Lys Leu He Val Pro Val Thr Ala Ser Ala Leu 
15 10 15 
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20 25 30 

Leu lie Ser Ser Lys Ala Gly Asp Val Thr Val Ala Asp Thr Met Lys 
35 40 45 

Lys He Gly Lys Asp Gin He Ala Asn Ala Ser Phe Thr Glu Met Leu 
50 55 60 

Asn Lys lie Leu Ala Asp Lys Tyr Lys Asn Lys Val Asn Asp Lys Lys 
65 70 75 80 

He Asp Glu Gin lie Glu Lys Met Gin Lys Gin Tyr Gly Gly Lys Asp 
85 90 ' 95 

Lys Phe Glu Lys Ala Leu Gin Gin Gin Gly Leu Thr Ala Asp Lys Tyr 
100 105 110 

Lys Glu Asn Leu Arg Thr Ala Ala Tyr His Lys Glu Leu Leu Ser Asp 
115 120 125 

Lys He Lys He Ser Asp Ser Glu He Lys Glu Asp Ser Lys Lys Ala 
130 135 140 

Ser His He Leu He Lys Val Lys Ser Lys Lys Ser Asp Xaa Glu Gly 
145 150 155 160 

Leu Asp Asp Lys Glu Ala Lys Gin Lys Ala Glu Glu He Gin Lys Glu 
165 170 175 

Val Ser Lys Asp Pro Ser Lys Phe Gly Glu He Ala Lys Lys Glu Ser 
180 185 190 

Met Asp Thr Gly Ser Ala Lys Lys Asp Gly Glu Leu Gly Tyr Val Leu 
195 200 205 

Lys Gly Gin Thr Asp Lys Asp Phe Glu Lys Ala Leu Phe Lys Leu Lys 
210 215 220 

Asp Gly Glu Val Ser Glu Val Val Lys Ser Ser Phe Gly Tyr His He 
225 230 235 240 

He Lys Ala Asp Lys Pro Thr Asp Phe Asn Ser Glu Lys Gin Ser Leu 
245 250 255 

Lys Glu Lys Leu Val Asp Gin Lys Val Gin Lys Asn Pro Lys Leu Leu 
260 265 270 

Thr Asp Ala Tyr Lys Asp Leu Leu Lys Glu Tyr Asp Val Asp Phe Lys 
275 280 * 285 

Asp Arg Asp He Lys Ser Val Val Glu Asp Lys He Leu Asn Pro Glu 
290 295 300 

Lys Leu Lys Gin Gly Gly Ala Gin Gly Gly Gin Ser Gly Met Ser Gin 
305 310 315 320 



(2) INFORMATION FOR SEQ ID NO: 51 93: 
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(A) LENGTH: 33 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5193: 

Met Asn Lys Val He Lys Met Leu Val Val Thr Leu Ala Phe Leu Leu 
15 10 15 

Val Leu Ala Gly Cys Ser Gly Asn Ser Asn Lys Gin Ser Ser Asp Asn 
20 25 30 

Lys Asp Lys Glu Thr Thr Ser He Lys His Ala Met Gly Thr Thr Glu 
35 40 45 

He Lys Gly Lys Pro Lys Arg Val Val Thr Leu Tyr Gin Gly Ala Thr 
50 55 60 

Asp Val Ala Val Ser Leu Gly Val Lys Pro Val Gly Ala Val Glu Ser 
65 70 75 80 

Trp Thr Gin Lys Pro Lys Phe Glu Tyr lie Lys Asn Asp Leu Lys Asp 
85 90 95 

Thr Lys He Val Gly Gin Glu Pro Ala Pro Asn Leu Glu Glu He Ser 
100 105 110 

Lys Leu Lys Pro Asp Leu He Val Ala Ser Lys Val Arg Asn Glu Lys 
115 120 125 

Val Tyr Asp Gin Leu Ser Lys He Ala Pro Thr Val Ser Thr Asp Thr 
130 135 140 

Val Phe Lys Phe Lys Asp Thr Thr Lys Leu Met Gly Lys Ala Leu Gly 
145 150 155 160 

Lys Glu Lys Glu Ala Glu Asp Leu Leu Lys Lys Tyr Asp Asp Lys Val 
165 170 175 

Ala Ala Phe Gin Lys Asp Ala Lys Ala Lys Tyr Lys Asp Ala Trp Pro 
180 185 190 

Leu Lys Ala Ser Val Val Asn Phe Arg Ala Asp His Thr Arg He Tyr 
195 200 205 

Ala Gly Gly Tyr Ala Gly Glu He Leu Asn Asp Leu Gly Phe Lys Arg 
210 215 220 

Asn Lys Asp Leu Gin Lys Gin Val Asp Asn Gly Lys Asp He He Gin 
225 230 235 240 

Leu Thr Ser Lys Glu Ser He Pro Leu Met Asn Ala Asp His He Phe 
245 250 255 
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Val Val Lys Ser Asp Pro Asn Ala Lys Asp Ala Ala Leu Val Lys Lys 
260 265 270 

Thr Glu Ser Glu Trp Thr Ser Ser Lys Glu Trp Lys Asn Leu Asp Ala 
275 280 285 

Val Lys Asn Asn Gin Val Ser Asp Asp Leu Asp Glu lie Thr Trp Asn 
290 295 300 

Leu Ala Gly Gly Tyr Lys Ser Ser Leu Lys Leu He Asp Asp Leu Tyr 
305 310 315 320 

Glu Lys Leu Asn He Glu Lys Gin Ser Lys 
325 330 

(2) INFORMATION FOR SEQ ID NO: 5194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5194: 

Met Thr Arg Lys Phe Arg Thr Leu He Leu He Leu He Ala Thr He 
15 10 15 

Ala Leu Ser Gly Cys Ala Asn Asp Asp Gly He Tyr Ser Asp Lys Gly 
20 25 30 

Gin Val Phe Arg Lys He Leu Ser Ser Asp Leu Thr Ser Leu Asp Thr 
35 40 45 

Ser Leu He Thr Asp Glu He Ser Ser Glu Val Thr Ala Gin Thr Phe 
50 55 60 

Glu Gly Leu Tyr Thr Leu Gly Lys Gly Asp Lys Pro Val Leu Gly Val 
65 70 75 80 

Ala Lys Ala Phe Pro Glu Lys Ser Lys Asp Gly Lys Thr Leu Lys Val 
85 90 95 

Lys Leu Arg Ser Asp Ala Lys Trp Ser Asn Gly Asp Lys Val Thr Ala 
100 105 110 

Gin Asp Phe Val Tyr Ala Trp Arg Lys Thr Val Asp Pro Lys Thr Gly 
115 120 125 

Ser Glu Phe Ala Tyr He Met Gly Asp He Lys Asn Ala Ser Asp He 
130 135 140 

Ser Thr Gly Lys Lys Pro Val Glu Gin Leu Gly He Lys Ala Leu Asn 
145 150 155 160 
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Asp Glu Thr Leu Gin lie Glu Leu Glu Lys Pro Val Pro Tyr lie Asn 
165 170 175 

Gin Leu Leu Ala Leu Asn Thr Phe Ala Pro Gin Asn Glu Lys Val Ala 
180 185 190 

Lys Lys Tyr Gly Lys Asn Tyr Gly Thr Ala Ala Asp Arg Ala Val Tyr 
195 200 205 

Asn Gly Pro Phe Lys Val Asp Asp Trp Lys Gin Glu Asp Lys Thr Leu 
210 215 220 

Leu Ser Lys Asn Gin Tyr Tyr Trp Asp Lys Lys Asn Val Lys Leu Asp 
225 230 235 240 

Lys Val Asn Tyr Lys Val lie Lys Asp Leu Gin Ala Gly Ala Ser Leu 
245 250 255 

Tyr Asp Thr Glu Ser Val Asp Asp Ala Phe lie Thr Ala Asp Gin Val 

260 265 270 

Asn Lys Tyr Lys Asp Asn Lys Gly Leu Asn Phe Val Leu Thr Thr Gly 
275 280 285 

Thr Phe Phe Val Lys Met Asn Glu Lys Gin Tyr Pro Asp Phe Lys Asn 
290 295 300 

Lys Asn Leu Arg Leu Xaa Ser His Lys Gin 
305 310 

(2) INFORMATION FOR SEQ ID NO: 5195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 80 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5195: 

Met Lys Arg Leu He Gly Leu Val He Val Ala Leu Val Leu Leu Ala 
15 10 15 

Ala Cys Gly Gly Asn Asn Asp Lys Lys Val Thr He Gly Val Ala Ser 
20 25 30 

Asn Asp Thr Lys Ala Trp Glu Lys Val Lys Glu Leu Ala Lys Lys Asp 
35 40 45 

Asp He Asp Val Glu He Lys His Phe Ser Asp Tyr Asn Leu Pro Asn 
50 55 60 

Lys Ala Leu Asn Asp Gly Asp He Asp Met Asn Ala Phe Gin His Phe 
65 70 75 80 
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Ala Phe Leu Asp Gin Tyr Lys Lys Ala His Lys Gly Thr Lys lie Ser 
85 90 95 

Ala Leu Ser Thr Thr Val Leu Ala Pro Leu Gly He Tyr Ser Asp Lys 
100 105 110 

He Lys Asp Val Lys Lys Val Lys Asp Gly Ala Lys Val Val He Pro 
115 120 125 

Asn Asp Val Ser Asn Gin Ala Arg Ala Leu Lys Leu Leu Glu Ala Ala 
130 135 140 

Gly Leu He Lys Leu Lys Lys Asp Phe Gly Leu Ala Gly Thr Val Lys 
145 150 155 160 

Asp He Thr Ser Asn Pro Lys His Leu Lys He Thr Ala Val Asp Ala 
165 170 175 

Gin Gin Thr Ala Arg Ala Leu Ser Asp Val Asp He Ala Val He Asn 
180 185 190 

Asn Gly Val Ala Thr Lys Ala Gly Lys Asp Pro Lys Asn Asp Pro He 

195 200 205 

Phe Leu Glu Lys Ser Asn Ser Asp Ala Val Lys Pro Tyr He Asn He 
210 215 220 

Val Ala Val Asn Asp Lys Asp Leu Asp Asn Lys Thr Tyr Ala Lys He 
225 230 235 240 

Val Glu Leu Tyr His Ser Lys Glu Ala Gin Lys Ala Leu Gin Glu Asp 
245 250 255 

Val Lys Asp Gly Glu Lys Pro Val Asn Leu Ser Lys Asp Glu He Lys 
260 265 270 

Ala He Glu Thr Ser Leu Ala Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 5196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5196: 

Met Lys Lys Leu Phe Gly Leu He Leu Val Leu Thr Phe Ala Val Val 
15 10 15 

ss Leu Ala Ala Cys Gly Asn Gly Asn Lys Ser Gly Ser Asp Asp Lys Lys 

20 25 30 
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lie Thr Val Ser Ala Ser Pro Ala Pro His Ala 
Tyr Glu 



40 



Ala Lys Pro 

50 

He Asn Asp 
65 

Asp Ala Asn 
Asp Lys Gly 



Gly Ala Thr 
130 



Lys He Glu 
He Lys Phe 



Tyr Thr 

Tyr Phe 
85 

Tyr Lys 
100 

Tyr Ser 1 
Val Tyr ' 
Phe Val , 



Glu Lys Lys Gly 
55 

Thr Pro Asn Lys 
70 

Gin His Thr Pro 



Leu Leu 
75 

Tyr Leu 
90 

Gly Asp 
Ser Leu 
Pro Ala 



Glu He Leu 
45 

Leu Asp He 
60 

Asp Lys Gly 
Asn Thr Glu 



Val His Leu 
110 



Glu Gin Gly 
140 



Glu Lys 

Lys Thr 

Glu He 
80 

Lys Lys 
95 

Glu Pro 
Pro Lys 
Arg Phe 



Asp Ala Lys Phe Ser Asp : 
165 



He Lys Lys < 
■ Glu Asn Lys : 



Gin Asn Glu i 
195 



Glu Gin Lys : 
210 



Asp Asp Lys ] 
He Gin Asp 1 
Lys 



i Lys Gin Ser Ala 
185 



. Asp Ala Val He 
200 



Glu Phe 
He Asn 
Ser He 



! Ala val 
235 



He Tyr 
Ala He 



Ser Asn Phe 
205 



Ala Val Glu Ser Ala 
220 



Lys Ala Leu He Glu Val 
250 



Lys Glu Gly 
Leu Gin Ser 



His Gin 
240 



Asn Glu Lys Tyr , 
265 



Ala Val He : 
270 



INFORMATION FOR SEQ ID NO: 5197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 97: 

Met Lys Lys He Lys Tyr He Leu Val Val Phe Val Leu Ser Leu Thr 
15 10 15 

Val Leu Ser Gly Cys Ser Leu Pro Gly Leu Gly Ser Lys Ser Thr Lys 
20 25 30 

Asn Asp Val Lys He Thr Ala Leu Ser Thr Ser Glu Ser Gin He He 
35 40 45 

Ser His Met Leu Arg Leu Leu He Glu His Asp Thr His Gly Lys He 
50 55 60 

Lys Pro Thr Leu Val Asn Asn Leu Gly Ser Ser Thr He Gin His Asn 
65 70 75 80 

Ala Leu He Asn Gly Asp Ala Asn He Ser Gly Val Arg Tyr Asn Gly 
85 90 95 

Thr Asp Leu Thr Gly Ala Leu Lys Glu Ala Pro He Lys Asn Pro Lys 
100 ' 105 110 

Lys Ala Met He Ala Thr Gin Gin Gly Phe Lys Lys Lys Phe Asp Gin 
115 120 125 

Thr Phe Phe Asp Ser Tyr Gly Phe Ala Asn Thr Tyr Ala Phe Met Val 
130 135 140 

Thr Lys Glu Thr Ala Lys Lys Tyr His Leu Glu Thr Val Ser Asp Leu 
145 150 155 160 

Ala Lys His Ser Lys Asp Leu Arg Leu Gly Met Asp Ser Ser Trp Met 
165 170 175 

Asn Arg Lys Gly Asp Gly Tyr Glu Gly Phe Lys Lys Glu Tyr Gly Phe 
180 185 190 

Asp Phe Gly Thr Val Arg Pro Met Gin lie Gly Leu Val Tyr Asp Ala 
195 200 205 

Leu Asn Ser Glu Lys Leu Asp Val Ala Leu Gly Tyr Ser Thr Asp Gly 
210 215 220 

Arg He Ala Ala Tyr Asp Leu Lys Val Leu Lys Asp Asp Lys Gin Phe 
225 230 235 240 

Phe Pro Pro Tyr Ala Ala Ser Ala Val Ala Thr Asn Glu Leu Leu Arg 
245 250 255 

Gin His Pro Glu Leu Lys Thr Thr He Asn Lys Leu Thr Gly Lys He 
260 265 270 

Ser Thr Ser Glu Met Gin Arg Leu Asn Tyr Glu Ala Asp Gly Lys Gly 
275 280 285 

Lys Glu Pro Ala Val Val Ala Glu Glu Phe Leu Lys Lys His His Tyr 
290 295 300 
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Phe Asp Lys Gin Lys Gly Gly His Lys 
305 310 

<2) INFORMATION FOR SEQ ID NO:5198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO -.5198: 



Met Lys Lys Leu Thr Thr Leu Leu Leu Ala Ser Thr Leu Leu lie Ala 
15 10 15 

Ala Cys Gly Asn Asp Asp Ser Lys Lys Asp Asp Ser Lys Thr Ser Lys 
20 25 30 

Lys Asp Asp Gly Val Lys Ala Glu Leu Lys Gin Ala Thr Lys Ala Tyr 
35 40 45 

Asp Lys Tyr Thr Asp Glu Gin Leu Asn Glu Phe Leu Lys Gly Thr Glu 
50 55 60 

Lys Phe Val Lys Ala He Glu Asn Asn Asp Met Ala Gin Ala Lys Ala 
65 70 75 80 

Leu Tyr Pro Lys Val Arg Met Tyr Tyr Glu Arg Ser Glu Pro Val Ala 
85 90 95 

Glu Ala Phe Gly Asp Leu Asp Pro Lys He Asp Ala Arg Leu Ala Asp 

100 105 110 

Met Lys Glu Glu Lys Lys Glu Lys Glu Trp Ser Gly Tyr His Lys He 
115 120 125 

Glu Lys Ala Leu Tyr Glu Asp Lys Lys He Asp Asp Val Thr Lys Lys 
130 135 140 

Asp Ala Gin Gin Leu Leu Lys Asp Ala Lys Glu Leu His Ala Lys Ala 
145 150 155 160 

Asp Thr Leu Asp He Thr Pro Lys Leu Met Leu Gin Gly Ser Val Asp 
165 170 175 

Leu Leu Asn Glu Val Ala Thr Ser Lys He Thr Gly Glu Glu Glu He 
180 185 190 

Tyr Ser His Thr Asp Leu Tyr Asp Phe Lys Ala Asn Val Glu Gly Ala 
195 200 205 

Gin Lys He Tyr Asp Leu Phe Lys Pro He Leu Glu Lys Lys Asp Lys 
210 215 220 
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Lys Leu Ser Asp Asp He Gin Met Asn Phe Asp Lys Val Asn Gin Leu 
225 230 235 240 

Leu Asp Lys Tyr Lys Asp Asn Asn Gly Gly Tyr Glu Ser Phe Glu Lys 
245 250 255 

Val Ser Lys Lys Asp Arg Lys Ala Phe Ala Asp Ala Val Asn Ala Leu 
260 265 270 

Gly Glu Pro Leu Ser Lys Met Ala Val He Thr Glu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 5199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5199: 

Met Lys Lys Leu Thr Thr Leu Leu Leu Ala Ser Thr Leu Leu He Ala 
15 10 15 

Ala Cys Gly Asn Asp Asp Ser Lys Lys Asp Asp Ser Lys Thr Ser Lys 
20 25 30 

Lys Asp Asp Gly Val Lys Ala Glu Leu Lys Gin Ala Thr Lys Ala Tyr 
35 40 45 

Asp Lys Tyr Thr Asp Glu Gin Leu Asn Glu Phe Leu Lys Gly Thr Glu 
50 55 60 

Lys Phe Val Lys Ala He Glu Asn Asn Asp Met Ala Gin Ala Lys Ala 
65 70 75 80 

Leu Tyr Pro Lys Val Arg Met Tyr Tyr Glu Arg Ser Glu Pro Val Ala 
85 90 95 

Glu Ala Phe Gly Asp Leu Asp Pro Lys He Asp Ala Arg Leu Ala Asp 
100 105 110 

Met Lys Glu Glu Lys Lys Glu Lys Glu Trp Ser Gly Tyr His Lys He 
115 120 125 

Glu Lys Ala Leu Tyr Glu Asp Lys Lys He Asp Asp Val Thr Lys Lys 
130 135 140 

Asp Ala Gin Gin Leu Leu Lys Asp Ala Lys Glu Leu His Ala Lys Ala 
145 150 155 160 

Asp Thr Leu Asp He Thr Pro Lys Leu Met Leu Gin Gly Ser Val Asp 
165 170 175 
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Leu Leu Asn Glu Val Ala Thr Ser Lys He Thr Gly Glu Glu Glu He 

180 185 190 

Tyr Ser His Thr Asp Leu Tyr Asp Phe Lys Ala Asn Val Glu Gly Ala 

195 200 205 

Gin Lys He Tyr Asp Leu Phe Lys Pro He Leu Glu Lys Lys Asp Lys 

210 215 220 

Lys Leu Ser Asp Asp He Gin Met Asn Phe Asp Lys Val Asn Gin Leu 

225 230 235 240 

Leu Asp Lys Tyr Lys Asp Asn Asn Gly Gly Tyr Glu Ser Phe Glu Lys 

245 250 255 

Val Ser Lys Lys Asp Arg Lys Ala Phe Ala Asp Ala Val Asn Ala Leu 

260 265 270 



(2) INFORMATION FOR SEQ ID NO: 5200: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5200: 

Met Lys Phe Lys Ala He Val Ala He Thr Leu Ser Leu Ser Leu Leu 
15 10 15 

Thr Ala Cys Gly Ala Asn Gin His Lys Glu Asn Ser Ser Lys Ser Asn 
20 25 30 

Asp Thr Asn Lys Lys Thr Gin Gin Thr Asp Asn Thr Thr Gin Ser Asn 
35 40 45 

Thr Glu Lys Gin Met Thr Pro Gin Glu Ala Glu Asp He Val Arg Asn 
50 55 60 

Asp Tyr Lys Ala Arg Gly Val Asn Glu Tyr Gin Thr Leu Asn Tyr Lys 
65 70 75 80 

Thr Asn Leu Glu Arg Ser Asn Glu His Glu Tyr Tyr Val Glu His Leu 
85 90 95 

Val Arg Asp Ala Val Gly Thr Pro Leu Lys Arg Cys Ala He Val Asn 
100 105 110 

Arg His Asn Gly Thr He He Asn He Phe Asp Asp Met Ser Glu Lys 
115 120 125 
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Asp Lys Glu Glu Phe Glu Ala Phe Lys Lys Arg Ser Pro Lys Tyr Asn 
130 135 * 140 

Pro Gly Met Asn Asn His Asp Glu Thr Asp Gly Glu Ser Glu Asp lie 
145 150 155 ISO 

Gin His His Asp lie Asp Asn Asn Lys Ala He Gin Asn Asp He Pro 
165 170 175 

Asp Gin Lys Val Asp Asp Lys Asn Asp Lys Asn Ala Val Asn Lys Glu 
180 185 190 

Glu Lys His Asp Asn Gly Ala Asn Asn Ser Glu Glu Thr Lys Val Lys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 5201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID N0:5201: 

Met Leu Lys Gly Cys Gly Gly Cys Leu He Ser Phe He Leu Leu He 
15 10 15 

He Leu Leu Ser Ala Cys Ser Met Met Phe Ser Asn Asn Asp Asn Ser 
20 25 30 

Thr Asn Asn Gin Ser Ser Lys Thr Gin Leu Thr Gin Lys Asp Glu Asn 
35 40 45 

Lys Asn Glu Asp Lys Pro Glu Glu Lys Ser Glu Thr Ala Thr Asp Glu 
50 55 60 

Asp Leu Gin Ser Thr Glu Glu Val Pro Ala Asn Glu Asn Thr Glu Asn 

65 70 75 80 

Asn Gin His Glu He Asp Glu He Thr Thr Lys Asp Gin Ser Asp Asp 
85 90 95 

Asp He Asn Thr Pro Asn Val Ala Glu Asp Lys Ser Gin Asp Asp Leu 
100 105 110 

Lys Asp Asp Leu Lys Glu Lys Gin Gin Ser Ser Asn His His Gin Ser 
115 120 125 

Thr Gin Pro Lys Thr Ser Pro Ser Thr Glu Thr Asn Thr Gin Gin Ser 
130 135 140 

Phe Ala Asn Cys Lys Gin Leu Arg Gin Val Tyr Pro Asn Gly Val Thr 
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Ala Asp His Pro Ala Tyr Arg Pro His Leu Asp Arg Asp Lys Asp Lys 
165 170 175 

Arg Ala Cys Glu Pro Asp Lys Tyr 
180 

(2) INFORMATION FOR SEQ ID NO: 5202: 

(i) SEQUENCE CHARACTERISTICS: 
'o (A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

, 5 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5202: 

Met Lys Lys Arg Leu Leu Leu Ser Thr Phe Leu Ala Ser Thr Leu lie 
15 10 15 

Leu Thr Gly Cys Ala Ser Asp Gin Ser Asp Asn Glu Asp His His Thr 
20 25 30 

Ser Thr Gly lie His Ala Pro Lys Ser Ala Lys Lys Leu Glu Thr Lys 
35 40 45 

Asp He Phe Xaa Ser Asp Lys Lys Asn Ser Asp He Ser Asp Ala Glu 
50 55 60 

Met Lys Gin Ala He Glu Lys Tyr Leu Ser Val Asn Ser Asp He Leu 
65 70 75 60 

Asp Asn Lys Tyr He Met Gin His Lys Leu Asp Lys Gin He Asp Ser 
85 90 95 

Gin Thr Lys Val Thr Glu Lys Gin Ala Glu Thr Leu Ser His Leu Ser 
100 105 110 

Asn Leu Ala Val Lys Asn Asp Leu His Phe Lys Lys Phe Val Thr Glu 
115 120 125 

Asn Asn He Pro Lys Glu Tyr Lys Lys Pro Val Glu Leu Met Met Asn 
130 135 140 

Tyr Phe Lys Ala Leu Asn Ser Thr He Ala Asn Val Asp Glu Asp He 
145 150 155 160 

Glu Lys Leu Ser Tyr Gin Pro Gin Asn Lys He Asn Val Val Asp Val 
165 170 175 

Pro Thr Lys Tyr Ala Gly Asp Val Asn Lys Lys Gin Gin Asp Lys He 
180 185 190 

Lys Asp Phe Leu Lys Ser Lys Gly He Lys Ser Asp Val He Asp Lys 
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(2) INFORMATION FOR SEQ ID NO: 52 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5203: 

Met Lys Ser Lys lie Tyr lie Leu Leu Leu Xaa Leu lie Phe Leu Ser 
15 10 15 

Ala Cys Ala Asn Thr Arg His Ser Glu Ser Asp Lys Asn Val Leu Thr 
20 25 30 

Val Tyr Ser Pro Tyr Gin Ser Asn Leu lie Arg Pro lie Leu Asn Glu 
35 40 45 

Xaa Glu Lys Gin Glu His Val Lys lie Glu lie Lys His Gly Ser Thr 
50 55 60 

Gin Val Leu Leu Ser Asn Leu His Asn Glu Asp Phe Ser Glu Arg Gly 
65 70 75 80 

Asp Val Phe Met Gly Gly Val Leu Ser Glu Thr lie Asp His Pro Glu 
85 90 * 95 

Asp Phe Val Pro Tyr Gin Asp Thr Ser Val Thr Gin Gin Leu Glu Asp 
100 105 110 

Tyr Arg Ser Asn Asn Lys Tyr Val Thr Ser Phe Leu Leu Met Pro Thr 
115 120 125 

Val He val Val Asn Ser Asp Leu Gin Gly Asp He Lys He Arg Gly 
130 135 140 

Tyr Gin Asp Leu Leu Gin Pro He Leu Lys Gly Lys He Ala Tyr Ser 
145 150 155 160 

Asn Pro Asn Thr Thr Thr Thr Gly Tyr Gin His Met Arg Ala He Tyr 
165 170 " 175 

Ser Met His His Arg Val Ser Asp Val His Gin Phe Gin Asn His Ala 
180 185 190 

Met Gin Leu Ser Lys Thr Ser Lys Val He Glu Asp Val Ala Lys Gly 
195 200 205 

Lys Tyr Tyr Ala Gly Leu Ser Tyr Glu Gin Asp Ala Arg Thr Trp Lys 
210 215 220 



3200 



EP 0 786 519 A2 



Leu Asn Val Asp Gly He Ala Leu Val Lys Asn Ala His Pro His Pro 
245 250 255 

Lys Arg Lys Lys Leu Val Gin Tyr Leu Thr Ser Arg Ser Val Gin Gin 
260 265 " 270 

Arg Leu Val Ala Glu Phe Asp Ala Lys Ser He 
275 280 

(2) INFORMATION FOR SEQ ID NO:5204: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 09 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5204: 

Met Lys Lys Phe He Gly Ser Val Leu Ala Thr Thr Leu He Leu Gly 
15 10 15 

Gly Cys Ser Thr Met Glu Asn Glu Ser Lys Lys Asp Thr Lys Thr Glu 
20 25 30 

Thr Lys Ser Val Pro Glu Glu Met Glu Ala Ser Lys Tyr Val Gly Gin 
35 40 45 

Gly Phe Gin Pro Pro Ala Glu Lys Asn Ala He Glu Phe Ala Lys Lys 
50 55 60 

His Arg Lys Glu Phe Glu Lys Val Gly Glu Gin Phe Phe Lys Asp Asn 
65 70 75 ' 80 

Phe Gly Leu Lys Val Lys Ala Thr Asn Val Val Gly Lys Asp Asp Gly 
85 90 95 

Val Glu Val Tyr Val His Cys Glu Asp His Gly He Val Phe Asn Ala 
100 105 110 

Ser Leu Pro Leu Tyr Lys Asp Ala He His Gin Lys Gly Ser Met Arg 
115 120 125 

Ser Asn Asp Asn Gly Asp Asp Met Ser Met Met Val Gly Thr Val Leu 
130 135 140 

Ser Gly Phe Glu Tyr Arg Ala Gin Lys Glu Lys Tyr Asp Asn Leu Tyr 
145 150 155 160 

Lys Phe Phe Lys Glu Asn Glu Lys Lys Tyr Gin Tyr Thr Gly Phe Thr 
165 170 175 
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Phe Tyr lie Thr Tyr Ser Ser Arg Ser Leu Lys Glu Tyr Arg Lys Tyr 
195 200 205 

Tyr Glu Pro Leu lie Arg Lys Asn Asp Lys Glu Phe Lys Glu Gly Met 
210 215 220 

Glu Arg Ala Arg Lys Glu Val Asn Tyr Ala Ala Asn Thr Asp Ala Val 
225 230 235 240 

Ala Thr Leu Phe Ser Thr Lys Lys Asn Phe Thr Lys Asp Asn Thr Val 
245 250 255 

Asp Asp Val lie Glu Leu Ser Asp Lys Leu Tyr Asn Leu Lys Asn Lys 
260 265 270 

Pro Asp Lys Ser Thr lie Thr lie Gin lie Gly Lys Pro Thr He Asn 
275 280 285 

Thr Lys Lys Ala Phe Tyr Asp Asp Asn Arg Pro He Glu Tyr Gly Val 
290 295 300 

His Ser Lys Asp Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 5205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5205: 

Met Lys Lys Leu Val Ser He Val Gly Ala Thr Leu Leu Leu Ala Gly 
15 10 15 

Cys Gly Ser Gin Asn Leu Ala Pro Leu Glu Glu Lys Thr Thr Asp Leu 
20 25 30 

Arg Glu Asp Asn His Gin Leu Lys Leu Asp He Gin Glu Leu Asn Gin 
35 40 45 

Gin He Ser Asp Ser Lys Ser Lys He Lys Gly Leu Glu Lys Asp Lys 
50 55 60 

Glu Asn Ser Lys Lys Thr Ala Ser Asn Asn Thr Lys He Lys Leu Met 
65 70 75 80 

Asn Val Thr Ser Thr Tyr Tyr Asp Lys Val Ala Lys Ala Leu Lys Ser 
85 90 95 
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Val Gin Ser Lys Leu Asn Gin lie Ser Asn Asp lie Gin Ser Ala His 

115 120 125 

Thr Ser Tyr Lys Asp Ala lie Asp Gly Leu Ser Leu Ser Asp Asp Asp 

130 135 140 

Lys Lys Thr Ser Lys Asn lie Asp Lys Leu Asn Ser Asp Leu Asn His 

145 150 155 160 

Ala Phe Asp Asp lie Lys Asn Gly Tyr Gin Asn Lys Asp Lys Lys Gin 

165 170 * * 175 

Leu Thr Lys Gly Gin Gin Ala Leu Ser Lys Leu Asn Leu Asn Ala Lys 

180 185 190 



INFORMATION FOR SEQ ID NO: 5206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5206: 

Met Lys Arg Leu Leu Phe Val Met lie Ala Phe Val Phe lie Leu Ala 
15 10 15 

Ala Cys Gly Asn Asn Ser Ser Lys Asp Lys Glu Ala Ser Lys Asp Ser 
20 25 30 

Lys Thr He Asn Val Gly Thr Glu Gly Thr Tyr Ala Pro Phe Ser Phe 
35 40 45 

His Asp Lys Asp Gly Lys Leu Thr Gly Tyr Asp He Asp Val He Lys 
50 55 60 

Ala Val Ala Lys Glu Glu Gly Leu Lys Leu Lys Phe Asn Glu Thr Ser 
65 70 75 80 

Trp Asp Ser Met Phe Ala Gly Leu Asp Ala Gly Arg Phe Asp Val He 
85 90 95 

Ala Asn Gin Val Gly He Asn Pro Asp Arg Glu Lys Lys Tyr Lys Phe 
100 105 no 

Ser Lys Pro Tyr Thr Phe Ser Ser Ala Val Leu Val He Arg Glu Asn 
115 120 125 
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Gin Thr Phe Thr Ser Asn Tyr Gly Lys Leu Ala Lys Asp Lys Gly Ala 

145 150 155 160 

Asp He Thr Lys Val Asp Gly Phe Asn Gin Ser Met Asp Leu Leu Leu 

165 170 " 175 

Ser Lys Arg Val Asp Gly Thr Phe Asn Asp Ser Leu Ser Tyr Leu Asp 

180 185 190 

Tyr Lys Lys Gin Lys Pro Asn Ala Lys He Lys Ala He Lys Gly Asn 

195 200 205 

Ala Glu Gin Ser Arg Ser Ala Phe Ala Phe Ser Lys Lys Ala Asp Asp 

210 215 220 

Glu Thr Val Gin Lys Phe Asn Asp Gly Leu Lys Lys He Glu Glu Asn 

225 230 235 240 

Gly Glu Leu Ala Lys He Gly Lys Lys Trp Phe Gly Gin Asp Val Ser 



(2) INFORMATION FOR SEQ ID NO: 5207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5207: 

Met Gly Val His Ser Met Lys Leu Lys Arg Leu Phe Ala Val Val He 

15 10 15 

Ala Met Leu Leu Val Leu Ala Gly Cys Ser Asn Ser Asn Asp Asn Asn 
20 25 30 

Glu Ser Lys Lys Asp Asp Ala Asp Asn Gly Lys Lys Gin Glu He Gin 
35 40 45 

Val Ala Ala Ala Ala Ser Leu Thr Asp Val Thr Lys Lys Leu Ala Ser 
50 55 * 60 

Glu Phe Lys Lys Glu His Lys Asn Ala Asp He Lys Phe Asn Tyr Gly 
65 70 75 80 

Gly Ser Gly Ala Leu Arg Lys Gin He Glu Ser Gly Ala Pro Val Asp 

85 90 95 
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Asn Lys Ala His Asp Thr Tyr Lys Tyr Ala Lys Asn Ser Leu Val Leu 
115 120 " 125 

lie Gly Asp Lys Asp Ser Asn Tyr Thr Ser val Lys Asp Leu Lys Asp 
130 135 140 

Asn Asp Lys Leu Ala Leu Gly Glu Val Lys Thr Val Pro Ala Gly Lys 
145 150 155 160 

Tyr Ala Lys Gin Tyr Leu Asp Asn Asn Asn Leu Phe Lys Glu Val Glu 
165 170 175 

Ser Xaa He Val Tyr Ala Lys Asp Val Lys Gin Val Leu Asn Tyr Val 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 5208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5208: 

Met Lys Lys Trp Gin Phe Val Gly Thr Thr Ala Leu Gly Ala Thr Leu 
15 10 15 

Leu Leu Gly Ala Cys Gly Gly Gly Asn Gly Gly Ser Gly Asn Ser Asp 
20 25 30 

Leu Lys Gly Glu Ala Lys Gly Asp Gly Ser Ser Thr Val Ala Pro He 
35 40 45 

Val Glu Lys Leu Asn Glu Lys Trp Ala Gin Asp His Ser Asp Ala Lys 
50 55 60 

He Ser Ala Gly Gin Ala Gly Thr Gly Ala Gly Phe Gin Lys Phe He 
65 70 75 80 

Ala Gly Asp He Asp Phe Ala Asp Ala Ser Arg Pro He Lys Asp Glu 
85 90 95 

Glu Lys Gin Lys Leu Gin Asp Lys Asn He Lys Tyr Lys Glu Phe Lys 
100 105 " 110 

He Ala Gin Asp Gly Val Thr Val Ala Val Asn Lys Glu Asn Asp Phe 
115 120 125 



EP 0 786 519 A2 



Ala Lys Thr Trp Lys Asp Val Asn Ser Lys Trp Pro Asp Lys Lys lie 

145 150 155 * 160 

Asn Ala Val Ser Pro Asn Ser Ser His Gly Thr Tyr Asp Phe Phe Glu 

165 170 ' ' 175 

Asn Glu Val Met Asn Lys Glu Asp lie Lys Ala Glu Lys Asn Ala Asp 

180 185 190 

Thr Asn Ala He Val Ser Ser Val Thr Lys Asn Lys Glu Gly He Gly 

195 200 " 205 

Tyr Phe Gly Tyr Asn Phe Tyr Val Gin Asn Lys Asp Lys Leu Lys Glu 

210 215 220 

Val Lys He Lys Asp Glu Asn Gly Lys Ala Thr Glu Pro Thr Lys Lys 

225 230 235 240 

Thr He Gin Asp Asn Ser Tyr Ala Leu Ser Arg Pro Leu Phe He Tyr 

245 250 255 

Val Asn Glu Lys Ala Leu Lys Asp Asn Lys Val Met Ser Glu Phe He 

260 265 270 

Lys Phe Val Leu Glu Asp Lys Gly Lys Ala Ala Glu Glu Ala Gly Tyr 

275 280 285 

Val Ala Ala Pro Glu Lys Thr Tyr Lys Ser Gin Leu Asp Asp Leu Lys 

290 295 300 

Ala Phe He Asp Lys Asn Gin Lys Ser Asp Asp Lys Lys Ser Asp Asp 

305 310 315 320 



(2) INFORMATION FOR SEQ ID NO: 5209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:5209: 

Met Lys Arg Leu Ser He He Val He He Gly He Phe He He Thr 
15 10 15 

Gly Cys Asp Trp Gin Arg Thr Ser Lys Glu Arg Ser Lys Asn Ala Gin 
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35 40 45 

Asn Leu Met Met Thr Lys Lys Leu Leu Ser Gin Tyr Asn His Pro Lys 
50 55 60 

Tyr Lys Leu Glu Leu Val Lys Phe Asn Asn Trp Pro Asp Leu Met Asp 
65 70 75 80 

Ala Leu Asn Ser Gly Arg lie Asp Gly Ala Ser Thr Leu lie Glu Leu 
85 90 95 

Ala Met Lys Ser Lys Gin Lys Gly Ser Asn He Lys Ala Val Ala Leu 
100 105 110 

Gly His His Glu Gly Asn Val He Met Gly Gin Lys Gly Met His Leu 
115 " 120 125 

Asn Glu Phe Asn Asn Asn Gly Asp Asp Tyr His Phe Gly He Pro His 
130 135 140 

Arg Tyr Ser Thr His Tyr Leu Leu Leu Glu Glu Leu Arg Lys Gin Leu 
145 150 155 160 

Lys lie Lys Pro Gly His Phe Ser Tyr His Glu Met Ser Pro Ala Glu 
165 170 175 

Met Pro Ala Ala Leu Ser Glu His Arg He Thr Gly Tyr Ser Val Ala 
180 185 190 

Glu Pro Phe Gly Ala Leu Gly Glu Lys Leu Gly Lys Gly Lys Thr Leu 
195 200 205 

Lys His Gly Asp Asp Val He Pro Asp Ala Tyr Cys Cys Val Leu Val 
210 215 220 

Leu Arg Gly Glu Leu Leu Asp Gin His Lys Asp Val Ala Gin Ala Phe 
225 230 235 240 

Val Gin Asp Tyr Lys Lys Ser Gly Phe Lys Met Asn Asp Arg Lys Gin 
245 250 255 

Ser Val Asp He Met Thr His His Phe Lys Gin Ser Arg Asp Val Leu 
260 265 270 

Thr Gin Ser Ala Ala Trp Thr Ser Tyr Gly Asp Leu Thr He Lys Pro 
275 280 285 

Ser Gly Tyr Gin Glu He Thr Thr Leu Val Lys Gin His His Leu Phe 
290 295 300 

Asn Pro Pro Ala Tyr Asp Asp Phe Val Glu Pro Ser Leu Tyr Lys Glu 

305 310 315 320 

Ala Ser Arg Ser 

INFORMATION FOR SEQ ID NO: 5210: 
(i) SEQUENCE CHARACTERISTICS: 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-5210: 

Met Lys Lys Thr Leu Gly Cys Leu Leu Leu lie Met Leu Leu Val Val 
15 10 15 

Ala Gly Cys Ser Phe Gly Gly Asn His Lys Leu Ser Ser Lys Lys Ser 
20 25 30 

Glu Glu Ser Lys Gin Glu Thr Val Lys Lys Glu Ser Glu Glu Glu Lys 
35 40 45 

Asp Pro Asp Leu Glu Lys Tyr Glu Glu He Glu Lys Lys Met Lys Gly 
50 55 60 

He Lys Asp Ala Pro Ser Leu Asp Lys Leu Asp Pro Leu Met Thr Glu 
65 70 75 80 

Lys Ser Phe Thr Asn Ser Lys Gly He Gin Gly Trp Lys Asp Tyr Lys 
85 90 95 

Glu Leu Met Gly Lys Val Glu Leu Ala Asp Tyr Arg Phe Thr Lys Asp 
100 105 110 

Ser Lys Gly Ser Ser He Lys Asp Val Asp Ala Phe Phe Lys Gly Lys 
115 120 125 

Lys Gly He Lys Arg Lys Val He Glu Thr His Asp Asp Val Lys Gin 
130 135 140 

Val Asp Tyr Trp 
145 

INFORMATION FOR SEQ ID NO: 5211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5211: 

Trp Pro Cys Ala Thr Xaa Gin Glx Glu Trp Trp Ser Arg His Xaa Trp 
15 10 15 
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20 25 30 

His 



(2) INFORMATION FOR SEQ ID NO: 5212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 
w (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:5212: 

Met Ser He He Met Glu Val Ala Thr Met Gin Ala Lys Leu Thr Lys 
15 10 15 

Asn Glu Phe He Glu Trp Leu Lys Thr Ser Glu Gly Lys Gin Phe Asn 
20 25 30 

Val Asp Leu Trp Tyr Gly Phe Gin Cys Phe Asp Tyr Ala Asn Ala Gly 
35 40 45 

Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp 
50 55 60 

He Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gin Asn 
65 70 75 80 

Thr Pro Asp Phe Leu Ala Gin Pro Gly Asp Met Val Val Phe Gly Ser 



Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val He Glu Ala Thr 
100 105 110 

Leu Asp Tyr He He Val Tyr Glu Gin Asn Trp Leu Gly Gly Gly Trp 
115 120 125 

Thr Asp Gly He Glu Gin Pro Gly Trp Gly Trp Glu Lys Val Thr Arg 
130 135 140 

Arg Gin His Ala Tyr Asp Phe Pro Met Trp Phe He Arg Pro Asn Phe 
145 150 155 160 

Lys Ser Glu Thr Ala Pro Arg Ser Val Gin Ser Pro Thr Gin Ala Pro 
165 170 175 



He Lys Asp Val Val Lys Gly Tyr Asp Leu Pro Lys Arg Gly Ser Asn 
195 200 205 
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Ala Glu Ala Tyr Arg Asn Gly Leu Val Asn Ala Pro Leu Ser Arg Leu 
225 230 235 240 

Glu Ala Gly He Ala His Ser Tyr Val Ser Gly Asn Thr Val Trp Gin 
245 250 255 

Ala Leu Asp Glu Ser Gin Val Gly Trp His Thr Ala Asn Gin He Gly 
260 265 270 

Asn Lys Tyr Tyr Tyr Gly He Glu Val Cys Gin Ser Met Gly Ala Asp 
275 280 285 

Asn Ala Thr Phe Leu Lys Asn Glu Gin Ala Thr Phe Gin Glu Cys Ala 
290 295 300 

Arg Leu Leu Lys Lys Trp Gly Leu Pro Ala Asn Arg Asn Thr He Arg 

305 310 315 320 

Leu His Asn Glu Phe Thr Ser Thr Ser Cys Pro His Arg Ser Ser Val 
325 330 ' 335 

Leu His Thr Gly Phe Asp Pro Val Thr Arg Gly Leu Leu Pro Glu Asp 
340 345 ' 350 

Lys Arg Leu Gin Leu Lys Asp Tyr Phe He Lys Gin He Arg Ala Tyr 
355 360 365 

Met Asp Gly Lys He Pro Val Ala Thr Val Ser Asn Glu Ser Ser Ala 
370 375 380 

Ser Ser Asn Thr Val Lys Pro Val Ala Ser Ala Trp Lys Arg Asn Lys 
385 390 395 400 

Tyr Gly Thr Tyr Tyr Met Glu Glu Ser Ala Arg Phe Thr Asn Gly Asn 
405 410 415 

Gin Pro He Thr Val Arg Lys Val Gly Pro Phe Leu Ser Cys Pro Val 
420 425 430 

Gly Tyr Gin Phe Gin Pro Gly Gly Tyr Cys Asp Tyr Thr Glu Val Met 
435 440 445 

Leu Gin Asp Gly His Val Trp Val Gly Tyr Thr Trp Glu Gly Gin Arg 
450 455 460 

Tyr Tyr Leu Pro He Arg Thr Trp Asn Gly Ser Ala Pro Pro Asn Gin 
465 470 475 480 

He Leu Gly Asp Leu Trp Gly Glu He Ser 
485 * 490 

(2) INFORMATION FOR SEQ ID NO: 5213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5213 : 

Gly Asp Lys Met Asn Lys lie Ser Lys Tyr He Ala He Ala Ser Leu 
15 10 15 

Ser Val Ala Val Thr Val Ser Ala Pro Gin Thr Thr Asn Ser Thr Ala 
20 25 30 

Phe Ala Lys Ser Ser Ala Glu Val Gin Gin Thr Gin Gin Ala Ser He 
35 40 45 

Pro Ala Ser Gin Lys Ala Asn Leu Gly Asn Gin Asn He Met Ala Val 
50 55 60 

Ala Trp Tyr Gin Asn Ser Ala Glu Ala Lys Ala Leu Tyr Leu Gin Gly 
65 70 75 80 

Tyr Asn Ser Ala Lys Thr Gin Leu Asp Lys Glu He Lys Lys Asn Lys 
85 90 " 95 

Gly Lys His Lys Leu Ala He Ala Leu Asp Leu Asp Glu Thr Val Leu 
100 105 no 

Asp Asn Ser Pro Tyr Gin Gly Tyr Ala Ser lie His Asn Lys Pro Phe 
115 120 125 

Pro Glu Gly Trp His Glu Trp val Gin Ala Ala Lys Ala Lys Pro Val 
130 135 140 

Tyr Gly Ala Lys Glu Phe Leu Lys Tyr Ala Asp Lys Lys Gly Val Asp 
145 150 155 160 

He Tyr Tyr He Ser Asp Arg Asp Lys Glu Lys Asp Leu Lys Ala Thr 
165 170 175 

Gin Lys Asn Leu Lys Gin Gin Gly He Pro Gin Ala Lys Lys Ser His 
180 185 190 

He Leu Leu Lys Gly Lys Asp Asp Lys Ser Lys Glu Ser Arg Arg Gin 
195 200 205 

Met Val Gin Lys Asp His Lys Leu Val Met Leu Phe Gly Asp Asn Leu 
210 215 220 

Leu Asp Phe Thr Asp Pro Lys Glu Ala Thr Ala Glu Ser Arg Glu Ala 
225 230 235 240 

Leu He Glu Lys His Lys Asp Asp Phe Gly Lys Lys Tyr He He Phe 
245 250 255 

Pro Asn Pro Met Tyr Gly Ser Trp Glu Ala Thr He Tyr Asn Asn Asn 
260 265 270 
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Lys Gin Phe Asp Pro Lys Thr Gly Glu Val Lys 
290 295 

(2) INFORMATION FOR SEQ ID NO: 5214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5214: 



Leu Asn Lys Cys Lys lie lie lie Trp Arg lie lie Asn Met Lys Asn 
15 10 15 

Lys Leu He Ala Lys Ser Leu Leu Thr Leu Ala Ala He Gly He Thr 
20 25 30 

Thr Thr Thr He Ala Ser Thr Ala Asp Ala Ser Glu Gly Tyr Gly Pro 
35 40 45 

Arg Glu Lys Lys Pro Val Ser He Asn His Asn He Val Glu Tyr Asn 
50 55 60 

Asp Gly Thr Phe Lys Tyr Gin Ser Arg Pro Lys Phe Asn Ser Thr Pro 

65 70 75 80 

Lys Tyr He Lys Phe Lys His Asp Tyr Asn He Leu Glu Phe Asn Asp 
85 90 95 

Gly Thr Phe Glu Tyr Gly Ala Arg Pro Gin Phe Asn Lys Pro Ala Ala 
100 105 110 

Lys Thr Asp Ala Thr He Lys Lys Glu Gin Lys Leu He Gin Ala Gin 
115 120 * 125 

Asn Leu Val Arg Glu Phe Glu Lys Thr His Thr Val Ser Ala His Arg 
130 135 140 

Lys Ala Gin Lys Ala Val Asn Leu Val Ser Phe Glu Tyr Lys Val Lys 
145 150 155 " 160 

Lys Met Val Leu Gin Glu Arg He Asp Asn Val Leu Lys Gin Gly Leu 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 5215: 
(i) SEQUENCE CHARACTERISTICS: 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5215: 

Lys Glu Arg Val Leu Met Lys Lys Leu Leu Thr Ala Ser He He Ala 
15 10 15 

Cys Ser Val Val Met Gly Val Gly Leu Val Asn Thr Ser Ala Glu Ala 
20 25 30 

Ala Ser Gly Asn Ser He Asp Thr Val Lys Gin Leu He Lys Gly Asp 
35 40 45 

Gin Ser Leu Glu Asn Val Lys He Gly Glu Ser He Lys Asp Val Leu 
50 55 60 

Thr Lys Tyr Lys Asn Pro Met Tyr Ser Tyr Asn Glu Asp Gly Thr Glu 
65 70 75 " 80 

His Tyr Tyr Glu Phe His Thr Lys Lys Gly Met Leu Leu Val Thr Thr 
85 90 95 

Asp Gly Lys Lys Asn Asn Gly Lys Val Thr His He Ser Met Met Tyr 
100 105 HO 

Asn Asp Ala Asn Gly Pro Thr Tyr Gin Ala Val Lys Asn Tyr Val Gly 

115 120 125 

Lys Ala Val Thr His Thr Glu Tyr Ser Lys Val Ala Gly Asn Phe Gly 
130 135 140 

Tyr lie Glu Lys Gly Lys Thr Thr Tyr Gin Phe Ala Ser Ala Pro Lys 
145 150 155 160 

Asp Lys Asn He Lys Leu Tyr Arg He Asp Leu Glu Lys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 5216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5216: 
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15 10 15 

Asn Glu Asp Gly Ser Lys Lys Lys Met Ser Thr Thr Ala Lys Val Val 
20 25 30 

Ser lie Ala Thr Val Leu Leu Leu Leu Gly Gly Leu Val Phe Ala lie 
35 40 45 

Phe Ala Tyr Val Asp His Ser Asn Lys Ala Lys Glu Arg Met Leu Asn 
50 55 60 

Glu Gin Lys Gin Glu Gin Lys Glu Lys Arg Gin Lys Glu Asn Ala Glu 
65 70 75 80 

Lys Glu Arg Lys Lys Lys Gin Gin Glu Glu Lys Glu Gin Asn Glu Leu 
85 90 95 

Asp Ser Gin Ala Asn Gin Tyr Gin Gin Leu Pro Gin Gin Asn Gin Tyr 
100 105 110 

Gin Tyr Val Pro Pro Gin Gin Gin Ala Pro Thr Lys Gin Arg Pro Ala 
115 120 125 

Lys Glu Glu Asn Asp Asp Lys Ala Ser Lys Asp Glu Ser Lys Asp Lys 
130 135 140 

Asp Asp Lys Ala Ser Gin Asp Lys Ser Asp Asp Asn Gin Lys Lys Thr 
145 150 155 160 

Asp Asp Asn Lys Gin Pro Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 5217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5217: 

45 

Met Lys Arg Asn Phe Pro Lys Leu He Ala Leu Ser Leu He Phe Ser 
15 10 15 

Leu Ser Val Thr Pro He Ala Asn Ala Glu Ser Asn Ser Asn He Lys 
20 25 30 

Ala Lys Asp Lys Lys His Val Gin Val Asn Val Glu Asp Lys Ser Val 
35 40 45 

Pro Thr Asp Val Arg Asn Leu Ala Gin Lys Asp Tyr Leu Ser Tyr Val 
55 50 55 60 
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Gly Glu Pro Phe Lys lie Tyr Lys Phe Asn Lys Lys Ser Asp Gly Asn 
85 90 95 

Tyr Tyr Phe Pro Val Leu Asn Thr Glu Gly Asn lie Asp Tyr lie Val 
100 105 110 

Thr He Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 5218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5218: 

Asn Phe Lys Met Gin Glu Val Lys Tyr Met Thr Glu He Thr Phe Lys 
15 10 15 

Gly Gly Pro He His Leu Lys Gly Gin Gin He Asn Glu Gly Asp Phe 
20 25 30 

Ala Pro Asp Phe Thr Val Leu Asp Asn Asp Leu Asn Gin Val Thr Leu 
35 40 45 

Ala Asp Tyr Ala Gly Lys Lys Lys Leu He Ser Val Val Pro Ser He 
50 55 60 

Asp Thr Gly Val Cys Asp Gin Gin Thr Arg Lys Phe Asn Ser Asp Ala 
65 70 75 80 

Ser Lys Glu Glu Gly He Val Leu Thr He Ser Ala Asp Leu Pro Phe 
85 90 95 

Ala Gin Lys Arg Trp Cys Ala Ser Ala Gly Leu Asp Asn Val He Thr 
100 105 110 

Leu Ser Asp His Arg Asp Leu Ser Phe Gly Glu Asn Tyr Gly Val Val 
115 120 125 

Met Glu Glu Leu Arg Leu Leu Ala Arg Ala Val Phe Val Leu Asp Ala 
130 135 140 

Asp Asn Lys Val Val Tyr Lys Glu He Val Ser Glu Gly Thr Asp Phe 
145 150 155 160 

Pro Asp Phe Asp Ala Ala Leu Ala Ala Tyr Lys Asn He 
165 170 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 19: 

lie Glu Ser Arg Phe lie Met Ala Lys lie Asn Phe Asp Ala Ala Thr 
1 5 10 15 

Lys Gly Asn Pro Gly He Ser Thr Cys Ala He Val He Lys Glu Asp 
20 25 30 

Glu Gin His Tyr Thr Tyr Thr His Glu Leu Gly Glu Met Asp Asn His 
35 40 45 

Thr Ala Glu Trp Ala Ala Cys He Tyr Ala Leu Glu His Ala Arg Glu 
50 55 60 

Leu Asn Val Gin Asn Ala Leu Leu Tyr Thr Asp Ser Lys Leu He Ala 
65 70 75 80 

Asp Ser He Glu Ala Gly Tyr Val Lys Asn Ala Asn Phe Lys Pro Tyr 
85 90 95 

Phe Asp Gin He Glu He Phe Glu Lys Asp Phe Asp Leu Leu Phe Val 
100 105 110 

Lys Trp lie Pro Arg Glu Gin Asn Lys Glu Ala Asn Gin His Ala Gin 
115 120 125 

Gin Ala Leu Tyr Lys Leu He Lys Lys Asn Lys 
130 135 

INFORMATION FOR SEQ ID NO: 522 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5220: 

Met Pro Gly Thr Val Leu Asp Pro Gin Met He Lys Asn Glu Asp Val 
1 5 10 15 
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20 25 30 

Gly Val Asn Thr Ser Met Asp Trp Asp Arg Lys Tyr Pro Tyr Gly Asp 
35 40 45 

Thr Leu Arg Gly lie Phe Gly Asp Val Ser Thr Pro Ala Glu Gly lie 
50 55 60 

Pro Lys Glu Leu Thr Glu His Tyr Leu Ser Lys Gly Tyr Ser Arg Asn 



Asp Arg Val Gly Lys Ser Tyr Leu Glu Tyr Gin Tyr Glu Asp Val Leu 
85 90 95 

Arg Gly Lys Lys Lys Glu Met Lys Tyr Thr Thr Asp Lys Ser Gly Lys 
100 105 110 

Val Thr Ser Ser Glu Val Leu Xaa Pro Gly Ala Arg Gly Gin Asp Leu 
115 120 125 

Lys Leu Thr He Asp He Asp Leu Gin Lys Glu Val Glu Ala Leu Leu 
130 135 140 

Asp Lys Gin He Lys Lys Leu Ala Val Lys Val Pro Lys He Trp He 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 5221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5221: 

He Met Ala Tyr Asp Gly Leu Phe Thr Lys Lys Met Val Glu Ser Leu 
15 10 15 

Gin Phe Leu Thr Thr Gly Arg Val His Lys He Asn Gin Pro Asp Asn 
20 25 30 

Asp Thr He Leu Met Val Val Arg Gin Asn Arg Gin Asn His Gin Leu 
35 40 45 

Leu Leu Ser He His Pro Asn Phe Ser Arg Leu Gin Leu Thr Thr Lys 
50 55 60 

Lys Tyr Asp Asn Pro Phe Asn Pro Pro Met Phe Ala Arg Val Phe Arg 
65 70 75 " 80 
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Asp Axg Arg lie Glu lie Asp lie Lys Ser Lys Asp Glu He Gly Asp 
100 " 105 " 110 

Thr He Tyr Arg Thr Val He Leu Glu He Met Gly Lys His Ser Asn 
115 120 125 

Leu He Leu Val Asp Glu Asn Arg Lys He He Glu Gly Phe Lys His 
130 135 140 

Leu Thr Pro Asn Thr Asn His Tyr Arg Thr Val Met Pro Gly Phe Asn 
145 150 155 160 

Tyr Glu Ala Pro Pro Thr Gin His Lys He Asn Pro Tyr Asp He Thr 
165 170 175 

Gly Ala Glu Val Leu Lys Tyr He Asp Phe Asn Ala Gly Asn He Ala 
180 185 190 

Lys Gin Leu Leu Asn Gin Phe Glu Gly Phe Ser Pro Leu He Thr Asn 
195 200 205 

Glu He Val Ser Arg Arg Gin Phe Met Thr Ser Ser Thr Leu Pro Glu 
210 215 220 

Ala Phe Asp Glu Val Met Ala Glu Thr Lys Leu Pro Pro Thr Pro He 
225 230 235 240 

Phe His Lys Asn His Glu Thr Gly Lys Glu Asp Phe Tyr Phe He Lys 
245 250 255 

Leu Asn Gin Phe Asn Asp Asp Thr Val Thr Tyr Asp Ser Leu Asn Asp 
260 265 270 

Leu Leu Asp Arg Phe Tyr Asp Ala Arg Gly Glu Arg Glu Arg Val Lys 
275 280 285 

Gin Arg Ala Asn Asp Leu Val Arg Phe Val Gin Gin Gin Leu His Lys 
290 295 300 

Tyr Gin Asn Lys Leu Ala Ser 
305 310 

(2) INFORMATION FOR SEQ ID NO: 5222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5222: 
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15 10 15 

Glu Gin Leu Tyr Gly Glu Leu He Thr Ala Asn He Tyr Arg He Lys 
20 ' 25 30 

Gin Gly Asp Lys Glu Val Thr Ala Leu Asn Tyr Tyr Thr Asn Glu Glu 
35 40 45 

Val Val He Pro Leu Asn Pro Thr Lys Ser Pro Ser Ala Asn Ala Gin 
50 55 60 

Tyr Tyr Tyr Lys Gin Tyr Xaa Arg Met Lys Thr Arg Xaa Arg Glu Leu 
65 * 70 75 80 

Gin His Gin He Gin Leu Thr Lys Asp Asn He Asp Tyr Phe Ser Thr 
85 90 95 

He Glu Gin Gin Leu His His He Ser Val His Asp He Asp Glu He 
100 105 no 

Arg Asp Glu Leu Ala Glu Gin Gly Phe Met Lys Gin Arg Lys Asn Gin 
115 120 125 

Thr Lys Lys Lys Lys Ala Gin He Gin Leu Gin His Tyr Val Ser Thr 
130 135 140 

Asp Gly Asp Asp He Tyr Val Gly Lys Asn Asn Lys Gin Asn Asp Tyr 
145 150 155 160 

Leu Thr Asn Lys Lys Ala Lys Lys Thr His Thr Trp Leu His Thr Lys 
165 170 175 

Asp He Pro Gly Ser His Val Val He Phe Asn Asp Ala Pro Ser Asp 
180 185 190 

Thr Thr He Lys Glu Ala Ala Met Leu Ala Gly Tyr Phe Ser Lys Ala 
195 200 205 

Gly Asn Ser Gly Gin He Pro Val Asp Tyr Thr Leu He Lys Asn Val 
210 215 ' 220 

His Lys Pro Ser Gly Ala Lys Pro Gly Phe Val Thr Tyr Asp Asn Gin 
225 230 235 240 

Lys Thr Leu Tyr Ala 
245 

INFORMATION FOR SEQ ID NO: 5223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5223: 

Tyr lie Thr Asn Pro Gin Asn Pro Lys lie Lys lie Thr Gly lie Ser 
1 5 10 15 

Leu Ser Ser Gly Val Gly Asn Phe Phe He He Thr Asn Gly Lys Arg 
20 25 30 

lie He Val Ala Lys lie Lys Arg Asn Ala Asp Asn Asp Ser Ala Leu 
35 40 45 

Lys Ser Phe Asn Ala He Phe He He Gly Asn Ala Asp Pro His Asn 
50 55 60 

Met He Val Asn Lys Tyr Asp Arg Lys Val Val Ser Arg Ser Leu Phe 
65 70 75 80 

He Asn He He Thr Pro Leu He Met Cys Phe Tyr He Lys Lys Tyr 
85 90 95 

Asp Leu Lys 

(2) INFORMATION FOR SEQ ID NO: 5224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5224: 

Glu Asn Val Leu Ala Lys Glu Tyr Ala Val Lys Tyr Asn Ala Val Glu 
15 10 15 

Ala He Gin His Arg Gly Glu Thr Val Thr Glu Gly Ser Ser Ser Asn 
20 25 * 30 

Ala Tyr Ala He Lys Asp Gly Val He Tyr Thr His Pro He Asn Asn 
35 40 45 

Tyr He Leu Asn Gly He Thr Arg He Val He Lys Lys He Ala Glu 
50 55 60 

Asp Tyr Asn He Pro Phe Lys Glu Glu Thr Phe Thr Val Asp Phe Leu 
65 70 75 80 

Lys Asn Ala Asp Glu Val He Val Ser Ser Thr Ser Ala Glu Val Thr 
85 90 95 

Pro Val He Lys Leu Asp Gly Glu Pro Val Asn Asp Gly Lys Val Gly 
100 105 110 
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115 120 125 

His Ser He 



(2) INFORMATION FOR SEQ ID NO: 5225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5225: 

Asn His Leu Thr Ala Arg He He Asn Gin Glu Asp Asp Leu Met Asn 
1 5 10 15 

Leu Phe Arg Gin Gin Lys Phe Ser He Arg Lys Phe Asn Val Gly He 
20 25 30 

Phe Ser Ala Leu He Ala Thr Val Thr Phe He Ser Thr Asn Pro Thr 
35 40 45 

Thr Ala Ser Ala Ala Glu Gin Asn Gin Pro Ala Gin Asn Gin Pro Ala 
50 55 60 

Gin Pro Ala Asp Ala Asn Thr Gin Pro Asn Ala Asn Ala Gly Ala Gin 
65 70 75 * 80 

Ala Asn Pro Thr Ala Gin Pro Ala Ala Pro Ala Asn Gin Gly Gin Pro 
85 90 95 

Ala Val Gin Pro Ala Asn Gin Gly Gly Gin Ala Asn Pro Ala Gly Gly 
100 105 110 

Ala Ala Gin Pro Asn Thr Gin Pro Ala Gly Gin Gly Asp Gin Ala Asp 
115 120 * 125 

Pro Asn Asn Ala Ala Gin Ala Gin Pro Gly Asn Gin Ala Thr Pro Ala 
130 135 140 

Asn Gin Ala Gly Gin Gly Asn Asn Gin Ala Thr Pro Asn Asn Asn Ala 
145 150 155 160 

Thr Pro Ala Asn Gin Thr Gin Pro Ala Asn Ala Pro Ala Ala Ala Gin 
165 170 175 

Pro Ala Ala Pro Val Ala Ala Asn Ala Gin Thr Gin Asp Pro Asn Ala 
180 185 190 

Ser Asn Thr Gly Glu Gly Ser He Asn Thr Thr Leu Thr Phe Asp Asp 
195 200 205 



3221 



EP 0 786 519 A2 



Thr Asp Lys Val Asn Gly Tyr Ser Leu lie Asn Asn Gly Lys He Gly 
225 230 235 240 

Phe Val Asn Ser Glu Leu Arg Arg Ser Asp Met Phe Asp Lys Asn Asn 
245 250 255 

Pro Gin Asn Tyr Gin Ala Lys Gly Asn Val Ala Ala Leu Gly Arg Val 
260 265 270 

Asn Ala Asn Asp Ser Thr Asp His Gly Asn Phe Asn Gly He Ser Lys 
275 280 285 

Thr Val Asn Val Lys Pro Asp Ser Glu Leu He He Asn Phe Thr Thr 
290 295 300 

Met Gin Thr Asn Ser Lys Gin Gly Ala Thr Asn Leu Val He Lys Asp 
305 310 315 320 

Ala Lys Lys Asn Thr Glu Leu Ala Thr Val Asn Val Ala Lys Thr Gly 
325 330 335 

Thr Ala His Leu Phe Lys Val Pro Thr Asp Ala Asp Arg Leu Asp Leu 
340 345 350 

Gin Phe He Pro Asp Asn Thr Ala Val Ala Asp Ala Ser Arg He Thr 
355 360 365 

Thr Asn Lys Asp Gly Tyr Lys Tyr Tyr Ser Phe He Asp Asn Val Gly 
370 375 380 

Leu Phe Ser Gly Ser His Leu Tyr Val Lys Asn Arg Asp Leu Ala Pro 
385 390 395 400 

Lys Ala Thr Asn Asn Lys Glu Tyr Thr He Asn Thr Glu He Gly Asn 
405 410 415 

Asn Gly Asn Phe Gly Ala Ser Leu Lys Ala Asp Gin Phe Lys Tyr Glu 
420 425 430 

Val Thr Leu Pro Gin Gly Val Thr Tyr Val Asn Asn Ser Leu Thr Thr 
435 440 445 

Thr Phe Pro Asn Gly Asn Glu Asp Ser Thr Val Leu Lys Asn Met Thr 
450 455 460 

Val Asn Tyr Asp Gin Asn Ala Asn Lys Val Thr Phe Thr Ser Gin Gly 
465 470 475 480 

Val Thr Thr Ala Arg Gly Thr His Thr Lys Glu Val Leu Phe Pro Asp 
485 490 495 

Lys Ser Leu Lys Leu Ser Tyr Lys Val Asn val Ala Asn He Asp Thr 
500 505 510 

Pro Lys Asn He Asp Phe Asn Glu Lys Leu Thr Tyr Arg Thr Ala Ser 
515 520 525 
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530 535 540 

(2) INFORMATION FOR SEQ ID NO: 5226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 177 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5226: 

Tyr Lys Glu Leu Ser His Gly Arg Leu lie Gly Gly Thr Lys Met His 
15 10 15 

Lys Lys Tyr Phe lie Gly Thr Ser He Leu He Ala Val Phe Val Val 
20 25 30 

He Phe Asp Gin Val Thr Lys Tyr He He Ala Thr Thr Met Lys He 
35 40 45 

Gly Asp Ser Phe Glu Val He Pro His Phe Leu Asn He Thr Ser His 

50 55 60 

Arg Asn Asn Gly Ala Ala Trp Gly He Leu Ser Gly Lys Met Thr Phe 
65 70 75 80 

Phe Phe He He Thr He He He Leu He Ala Leu Val Tyr Phe Phe 
85 90 95 

He Lys Asp Ala Gin Tyr Asn Leu Phe Met Gin Val Ala He Ser Leu 
100 105 110 

Leu Phe Ala Gly Ala Leu Gly Asn Phe He Asp Arg He Leu Thr Gly 
115 120 125 

Glu Val Val Asp Phe He Asp Thr Asn He Phe Gly Tyr Asp Phe Pro 
130 135 140 

He Phe Asn He Ala Asp Ser Ser Leu Thr He Gly Val He Leu He 
145 150 155 160 

He lie Ala Leu Leu Lys Asp Thr Ser Asn Lys Lys Glu Lys Glu Val 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 5227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5227: 



Ala Gly Lys Ser Ser Leu He Lys Ser Leu He Gly Glu Phe Asn Ala 
15 10 15 

Thr Gly Thr Lys Leu Leu Tyr Asn Lys Pro He Gin Gin Gin Leu Gin 
20 25 30 

His He Thr Tyr He Pro Gin Lys Ala His He Asp Leu Asp Phe Pro 
35 40 45 

He Ser Val Glu Gin Val He Leu Ser Gly Cys Tyr Lys Glu He Gly 
50 55 60 

Trp Phe Arg Arg Pro Asn Lys Ser Ala Arg Asp Lys Leu Lys Gin Leu 
65 70 75 80 

Leu Ser Asp Leu Glu Leu Glu Ser Leu Arg His Arg Gin He Ser Glu 
8 5 90 " 95 

Leu Ser Gly Gly Gin Leu Gin Arg Val Leu Val Ala Arg Ala Leu Met 

100 105 110 

Ser Xaa Ser Glu Val Tyr Phe Leu Asp Glu Pro Phe Val Gly He Asp 
115 120 125 

Phe Ser Ser Glu Lys Leu He Met Thr Lys He Glu Asn Leu Lys Gin 
130 135 140 

Gin Gly Lys Leu He Leu He He His His Asp Leu Ser Lys Ala Lys 
145 150 155 * 160 

Gin Tyr Phe Asp Arg He He Leu Leu Asn Gin Thr Leu Arg Tyr Phe 
165 170 175 

Gly Asp Ser Glu Glu Ala Met Ser Val Thr Arg Leu Asn Glu Thr Phe 
180 185 190 

Met Ser Ser Thr Asp Cys Ser Asp Pro Ser Gin Arg Ser Asn He Thr 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 522 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5228: 

Thr Phe Arg lie He Phe Leu Leu Ser He Arg Lys Arg Ser Asn Arg 
15 10 15 

Thr His Val Ser He His Trp Ser Thr Val Asn Lys Glu Glu He Cys 
20 25 30 

Leu Arg Val Lys Asp Asn Leu Gin Gin He Ser Thr Gin He Asn Asp 
35 40 45 

Lys Ser Glu Lys Asn Asn Phe Ser Thr Lys Pro Asn Val He Ala Val 
50 55 60 

Thr Lys Tyr Val Thr He Glu Arg Ala Lys Glu Ala Tyr Glu Ala Gly 
65 70 75 80 

He Arg His Phe Gly Glu Asn Arg Leu Glu Gly Phe Phe Gin Lys Lys 
85 90 95 

Glu Ala Leu Pro Ser Asp Ala Val He His Phe He Gly Ser Leu Gin 
100 105 110 

Ser Arg Lys Val Lys Asp Val He Asn Asp Val Asp Tyr Phe His Ala 
115 120 125 

Leu Asp Arg Leu Ser Leu Ala Lys Glu He Asn Lys Arg Ala Glu His 
130 135 140 

Lys lie Lys Cys Phe Leu Gin Val Asn Val Ser Gly Glu Ala Ser Lys 
145 150 155 160 

His Gly He Ala Leu Glu Asp Val Asp Gin Phe He Asp Asp Leu Lys 
165 170 175 

Lys Tyr Asp Lys He Glu He Val Gly Leu Met Thr Met Ala Pro Leu 
180 185 190 

Thr Asp Asp Glu Ala Tyr lie Arg Ser Leu Phe Lys Gin Leu Arg Leu 
195 200 205 

Lys Lys Glu Glu He Gin Arg Leu Asn Leu Glu Tyr Ala Pro Cys Asp 
210 215 220 

Glu Leu Ser Met Gly Met Ser Asn Asp Tyr Leu He Ala Val Glu Glu 
225 230 235 240 

Gly Ala Thr Phe Val Arg He Gly Thr Lys Leu Val Gly Glu Glu Glu 
245 250 255 

(2) INFORMATION FOR SEQ ID NO: 5229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5229: 

Lys His Lys Leu Thr He He Thr Gly Gly Phe Phe Thr Met Lys Lys 
15 10 15 

Thr He Met Ala Ser Ser Leu Ala Val Ala Leu Gly Val Thr Gly Tyr 

20 25 30 

Ala Ala Gly Thr Gly His Gin Ala His Ala Ala Glu Val Asn Val Asp 
35 40 45 

Gin Ala His Leu Val Asp Leu Ala His Asn His Gin Asp Gin Leu Asn 
50 55 60 

Ala Ala Pro He Lys Asp Gly Ala Tyr Asp He His Phe Val Lys Asp 
65 70 75 80 

Gly Phe Gin Tyr Asn Phe Thr Ser Asn Gly Thr Thr Trp Ser Trp Ser 
85 90 * 95 

Tyr Glu Ala Ala Asn Gly Gin Thr Ala Gly Phe Ser Asn Val Ala Gly 
100 105 110 

Ala Asp Tyr Thr Thr Ser Tyr Asn Gin Gly Ser Asn Val Gin Ser Val 
115 120 125 

Ser Tyr Asn Ala Gin Ser Ser Asn Ser Asn Val Glu Ala Val Ser Ala 
130 135 140 

Pro Thr Tyr His Asn Tyr Ser Thr Ser Thr Thr Ser Ser Ser Val Arg 
145 150 155 160 

Leu Ser Asn Gly Asn Thr Ala Gly Ala Thr Gly Ser Ser Ala Ala Gin 
165 170 175 

He Met Ala Gin Arg Thr Gly Val Ser Ala Ser Thr Trp Ala Ala He 
180 185 190 

He Ala Arg Glu Ser Asn Gly Gin Val Asn Ala Tyr Asn Pro Ser Gly 
195 200 205 

Ala Ser Gly Leu Phe Gin Thr Met Pro Gly Trp Gly Pro Thr Asn Thr 
210 215 220 

Val Asp Gin Gin He Asn Ala Ala Val Lys Ala Tyr Lys Ala Gin Gly 
225 230 235 240 

Leu Gly Ala Trp Gly Phe 
245 

(2) INFORMATION FOR SEQ ID NO: 523 0: 
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(A) LENGTH: 519 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5230: 

Lys Glu Pro His Lys Met Lys Lys lie Tyr Lys Ser Leu Thr Val Ser 
15 10 15 

Ala He Val Ala Thr Val Ser Leu Ser Ala Leu Pro Gin Ser Leu Ala 
20 25 30 

He Thr His Glu Ser Gin Pro Thr Lys Gin Gin Arg Thr Val Leu Phe 
35 40 45 

Asp Arg Ser His Gly Gin Thr Ala Gly Ala Ala Asp Trp Val Ser Asp 
50 55 60 

Gly Ala Phe Ser Asp Tyr Ala Asp Ser He Gin Lys Gin Gly Tyr Asp 
65 70 75 80 

Val Lys Ala He Asp Gly His Ser Asn He Thr Glu Ala Ser Leu Lys 
85 90 95 

Ser Ser Lys He Phe Val He Pro Glu Ala Asn He Pro Phe Lys Glu 
100 105 110 

Ser Glu Gin Ala Ala He Val Lys Tyr Val Lys Gin Gly Gly Asn Val 
115 120 125 

Val Phe He Ser Asp His Tyr Asn Ala Asp Arg Asn Leu Asn Arg He 
130 135 140 

Asp Ser Ser Glu Ala Met Asn Gly Tyr Arg Arg Gly Ala Tyr Glu Asp 
145 150 155 160 

Met Ser Lys Gly Met Asn Ala Glu Glu Lys Ser Ser Thr Ala Met Gin 
165 170 175 

Gly Val Lys Ser Ser Asp Trp Leu Ser Thr Asn Phe Gly Val Arg Phe 
180 185 190 

Arg Tyr Asn Ala Leu Gly Asp Leu Asn Thr Ser Asn He Val Ser Ser 
195 200 205 

Lys Glu Ser Phe Gly He Thr Glu Gly Val Lys Ser Val Ser Met His 
210 215 220 

Ala Gly Ser Thr Leu Ala He Thr Asn Pro Glu Lys Ala Lys Gly He 
225 230 235 240 

Val Tyr Thr Pro Glu Gin Leu Pro Ala Lys Ser Lys Trp Ser His Ala 
245 250 255 
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Val Asp Gin Gly He Tyr Asn Gly Gly Gly Lys Ala Glu Gly Pro Tyr 
260 265 270 

Val Ala He Ser Lys Val Gly Lys Gly Lys Ala Ala Phe He Gly Asp 
275 280 285 

Ser Ser Leu Val Glu Asp Ser Ser Pro Lys Tyr Val Arg Glu Asp Asn 
290 295 ' 300 

Gly Glu Lys Lys Lys Thr Tyr Asp Gly Phe Lys Glu Gin Asp Asn Gly 
305 * 310 315 320 

Lys Leu Leu Asn Asn He Thr Ala Trp Met Ser Lys Asp Asn Asp Gly 
325 330 335 

Lys Ser Leu Lys Ala Ser Ser Leu Thr Leu Asp Thr Lys Thr Lys Leu 
340 345 350 

Leu Asp Phe Glu Arg Pro Glu Arg Ser Thr Glu Pro Glu Lys Glu Pro 

355 360 365 

Trp Ser Gin Pro Pro Ser Gly Tyr Lys Trp Tyr Asp Pro Thr Thr Phe 
370 375 380 

Lys Ala Gly Ser Tyr Gly Ser Glu Lys Gly Ala Asp Pro Gin Pro Asn 
385 390 395 400 

Thr Pro Asp Asp His Thr Pro Pro Asn Gin Asn Glu Lys Val Thr Phe 
405 410 415 

Asp He Pro Gin Asn Val Ser Val Asn Glu Pro Phe Glu Met Thr He 
420 425 430 

His Leu Lys Gly Phe Glu Ala Asn Gin Thr Leu Glu Asn Leu Arg Val 
435 * 440 445 

Gly He Tyr Lys Glu Gly Gly Arg Gin lie Gly Gin Phe Ser Ser Lys 
450 * 455 460 

Asp Asn Asp Tyr Asn Pro Pro Gly Tyr Ser Thr Leu Pro Thr Val Lys 
465 470 475 480 

Ala Asp Glu Asn Gly Asn Val Thr He Lys Val Asn Ala Lys Val Leu 
485 490 495 

Glu Ser Met Glu Gly Ser Lys He Arg Leu Lys Leu Gly Asp Lys Thr 
500 505 510 

Leu He Thr Thr Asp Phe Lys 
515 

(2) INFORMATION FOR SEQ ID NO: 5231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



3228 



EP 0 786 519 A2 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5231: 

Arg Asp Glu Arg lie Lys Thr Met Thr Asn Ser Ser Lys Ser Phe Thr 
1 5 10 15 

Lys Phe Met Ala Ala Ser Ala Val Phe Thr Met Gly Phe Leu Ser Val 
2 0 2 5 '30 

Pro Thr Ala Gly Ala Glu Gin Thr Asn Gin lie Ala Asn Lys Pro Gin 
35 40 45 

Ala He Gin Trp His Thr Asn Leu Thr Asn Glu Arg Phe Thr Thr He 
50 55 60 

Ala His Arg Gly Ala Ser Gly Tyr Ala Pro Glu His Thr Phe Gin Ala 
65 70 75 80 

Tyr Asp Lys Ser His Asn Glu Leu Lys Ala Ser Tyr He Glu He Asp 
85 90 95 

Leu Gin Arg Thr Lys Asp Gly His Leu Val Ala Met His Asp Glu Thr 
100 105 110 

Val Asn Arg Thr Thr Asn Gly His Gly Lys Val Glu Asp Tyr Thr Leu 
115 120 125 

Asp Glu Leu Lys Gin Leu Asp Ala Gly Ser Trp Phe Asn Lys Lys Tyr 
130 135 140 

Pro Lys Tyr Ala Arg Ala Ser Tyr Lys Asn Ala Lys Val Pro Thr Leu 
145 150 155 160 

Asp Glu He Leu Glu Arg Tyr Gly Pro Asn Ala Asn Tyr Tyr He Glu 
165 170 175 

Thr Lys Ser Pro Asp Val Tyr Pro Gly Met Glu Glu Gin Leu Leu Ala 
180 185 190 

Ser Leu Lys Lys His His Leu Leu Asn Asn Asn Lys Leu Lys Asn Gly 
195 200 205 

His Val Met He Gin Ser Phe Ser Asp Glu Ser Leu Lys Lys He His 
210 215 220 

Arg Gin Asn Lys His Val Pro Leu Val Lys Leu Val Asp Lys Gly Glu 
225 230 235 240 

Leu Gin Gin Phe Asn Asp Gin Arg Leu Lys Glu He Arg Ser Tyr Ala 
245 250 255 

He Gly Leu Gly Pro Asp Tyr Thr Asp Leu Thr Glu Gin Asn Thr His 
260 265 270 

His Leu Lys Asp Leu Gly Phe He Val His Pro Tyr Thr Val Asn Glu 
275 280 285 
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Lys Ala Asp Met Leu Arg Leu Asn Lys Tyr Gly Val Asp Gly Val Phe 
290 295 300 

Thr Asn Phe Ala Asp Lys Tyr Lys Glu Val lie Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 5232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5232: 

Arg Phe Met Lys Asn Leu He Ser He He He He Leu Cys Leu Thr 
15 10 15 

Leu Ser He Met Thr Pro Tyr Ala Gin Ala Thr Asn Ser Asp Val Thr 
20 * 25 30 

Pro Val Gin Ala Ala Asn Gin Tyr Gly Tyr Ala Gly Leu Ser Ala Ala 
35 40 45 

Tyr Glu Pro Thr Ser Ala Val Asn Val Ser Gin Thr Gly Gin Leu Leu 
50 55 60 

Tyr Gin Tyr Asn He Asp Thr Lys Trp Asn Pro Ala Ser Met Thr Lys 
65 ' 70 75 80 

Leu Met Thr Met Tyr Leu Thr Leu Glu Ala Val Asn Lys Gly Gin Leu 
85 90 95 

Ser Leu Asp Asp Thr Val Thr Met Thr Asn Lys Glu Tyr lie Met Ser 
100 105 110 

Thr Leu Pro Glu Leu Ser Asn Thr Lys Leu Tyr Pro Gly Gin Val Trp 
115 120 125 

Thr He Ala Asp Leu Leu Gin He Thr Val Ser Asn Ser Ser Asn Ala 
130 * 135 140 

Ala Ala Leu He Leu Ala Lys Lys Val Ser Lys Asn Thr Ser Asp Phe 
145 150 155 160 

Val Asp Leu Met Asn Asn Lys Ala Lys Ala He Gly Met Lys Asn Thr 
165 170 175 

His Phe Val Asn Pro Thr Gly Ala Glu Asn Ser Arg Leu Arg Thr Phe 
180 185 190 

Ala Pro Thr Lys Tyr Lys Asp Gin Glu Arg Thr Val Thr Thr Ala Arg 
195 200 205 
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Asp Tyr Ala lie Leu Asp Leu His Val lie Lys Glu Thr Pro Lys lie 
210 215 220 

Leu Asp Phe Thr Lys Gin Leu Ala Pro Thr Thr His Ala Val Thr Tyr 
225 230 235 240 

Tyr Thr Phe Asn Phe Ser Leu Glu Gly Ala Lys Met Ser Leu Pro Gly 
245 250 255 

Thr Asp Gly Leu Lys Thr Gly Ser Ser Asp Thr Ala Asn Tyr Asn His 
260 265 270 

Thr He Thr Thr Lys Arg Gly Lys Phe Arg He Asn Gin Val He Met 
275 280 285 

Gly Ala Gly Asp Tyr Lys Asn Leu Gly Gly Glu Lys Gin Arg Asn Met 
290 295 300 

Met Gly Asn Ala Leu Met Glu Arg Ser Phe Asp Gin Tyr Lys Tyr Val 
305 310 315 320 

Lys He Leu Ser Lys Gly Glu Gin Arg He Asn Gly Lys Lys Tyr Tyr 
325 330 335 

Val Glu Asn Asp Leu Tyr Asp Val Leu Pro Ser Asp Phe Ser Lys Lys 
340 345 * 350 

Asp Tyr Lys Leu Val Val Glu Asp Gly Lys Val His Ala Asp Tyr Pro 
355 360 365 

Arg Glu Phe He Asn Lys Asp Tyr Gly Pro Pro Thr Val Glu Val His 
370 375 380 

Gin Pro He He Gin Lys Ala Asn Thr Val Ala Lys Ser Met Trp Glu 
385 390 395 400 

Glu His Pro Leu Phe Thr He He Gly Gly Thr Cys Leu Val Ala Gly 
405 410 415 

Leu Ala Leu He Val His Met He He Asn Arg Leu Phe Arg Lys Arg 
420 425 430 



(2) INFORMATION FOR SEQ ID NO.-5233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5233: 
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Leu Thr Lys Glu Arg Glu Tyr Met Lys Leu Lys Ser Phe He Thr Val 
15 10 15 

Thr Leu Ala Leu Gly Met He Ala Thr Thr Gly Ala Thr Val Ala Gly 
20 * 25 30 

Asn Glu Val Ser Ala Ala Glu Lys Asp Lys Leu Pro Ala Thr Gin Lys 
35 40 45 

Ala Lys Glu Met Gin Asn Val Pro Tyr Thr He Ala Val Asp Gly He 
50 55 60 

Met Ala Phe Asn Gin Ser Tyr Leu Asn Leu Pro Lys Asp Ser Gin Leu 
65 70 75 80 

Ser Tyr Leu Asp Leu Gly Asn Lys Val Lys Ala Leu Leu Tyr Asp Glu 
85 90 95 

Arg Gly Val Thr Pro Glu Lys He Arg Asn Ala Lys Ser Ala Val Tyr 
100 105 110 

Thr He Thr Trp Lys Asp Gly Ser Lys Lys Glu Val Asp Leu Lys Lys 
115 120 125 

Asp Ser Tyr Thr Ala Asn Leu Phe Asp Ser Asn Ser He Lys Gin He 
130 135 140 

Asp He Asn Val Lys Thr Lys 
145 150 

(2) INFORMATION FOR SEQ ID NO: 5234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5234: 

Asn His Cys Asn Arg He Glu Arg Lys Met Ala Met Ser Asn Asn Phe 
15 10 15 

Lys Asp Asp Phe Glu Lys Asn Arg Gin Ser He Asp Thr Asn Ser His 
20 25 30 

Gin Asp His Thr Glu Asp Val Glu Lys Asp Gin Ser Glu Leu Glu His 
35 40 45 

Gin Asp Thr He Glu Asn Thr Glu Gin Gin Phe Pro Pro Arg Asn Ala 
50 55 60 

Gin Arg Arg Lys Arg Arg Arg Asp Leu Ala Thr Asn His Asn Lys Gin 



3232 



EP0 786 519 A2 

Val His Asn Glu Ser Gin Thr Ser Glu Asp Asn Val Gin Asn Glu Ala 
85 90 95 

Gly Thr He Asp Asp Arg Gin Val Glu Ser Ser His Ser Thr Glu Ser 
100 105 110 

Gin Glu Pro Ser His Gin Asp Ser Thr Pro Gin His Glu Glu Glu Tyr 
115 120 125 

Tyr Asn Lys Asn Ala Phe Ala Met Asp Lys Ser His Pro Glu Pro He 
130 135 140 

Glu Asp Asn Asp Lys His Asp Thr He Lys Asn Ala Glu Asn Asn Thr 
145 150 155 160 

Glu His Ser Thr Val Ser Asp Lys Ser Glu Ala Glu Gin Ser Gin Gin 
165 170 175 

Pro Lys Pro Tyr Phe Thr Thr Gly Ala Asn Gin Ser Glu Thr Ser Lys 
180 185 190 

Asn Glu His Asp Asn Asp Ser Val Lys Gin Asp Gin Asp Glu Pro Lys 
195 * 200 205 

Glu His His Asn Gly Lys Lys Ala Ala Ala He Gly Ala Gly Thr Ala 
210 215 220 

Gly Val Ala Gly Ala Ala Gly Ala Met Ala Ala Ser Lys Ala Lys Lys 
225 230 235 240 

His Ser Asn Asp Ala Gin Asn Lys Ser Asn Ser Gly Lys Ala Asn Asn 
245 250 255 

Ser Thr Glu Asp Lys Ala Ser Gin Asp Lys Ser Lys Asp His His Asn 
260 265 270 

Gly Lys Lys Gly Ala Ala He Gly Ala Gly Thr Ala Gly Leu Ala Gly 
275 280 285 

Gly Ala Ala Ser Lys Ser Ala Ser Ala Ala Ser Lys Pro His Ala Ser 
290 295 300 

Asn Asn Ala Ser Gin Asn His Asp Glu His Asp Asn His Asp Arg Asp 
305 310 315 320 

Lys Glu Arg Lys Lys Gly Gly Met Ala Lys Val Leu Leu Pro Leu He 
325 330 335 

Ala Ala Val Leu He He Gly Ala Leu Ala He Phe Gly Gly Met Ala 
340 345 350 

Leu Asn Asn His Asn Asn Gly Thr Lys Glu Asn Lys He Ala Asn Thr 
355 360 365 

Asn Lys Asn Asn Ala Asp Glu Ser Lys Asp Lys Asp Thr Ser Lys Asp 
370 375 380 

Ala Ser Lys Asp Lys Ser Lys Ser Thr Asp Ser Asp Lys Ser Lys Glu 
385 390 395 400 
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Asp Gin Asp Lys Ala Thr Lys Asp Glu Ser Asp Asn Asp Gin Asn Asn 
405 410 415 

Ala Asn Gin Ala Asn Asn Gin Ala Gin Asn Asn Gin Asn Gin Gin Gin 
420 425 430 

Ala Asn Gin Asn Gin Gin Gin Gin Gin Gin Arg Gin Gly Gly Gly Gin 
435 440 445 

Arg His Thr Val Asn Gly Gin Glu Asn Leu Tyr Arg lie Ala lie Gin 
450 455 460 

Tyr Tyr Gly Ser Gly Ser Pro Glu Asn Val Glu Lys lie Arg Arg Ala 
465 470 475 480 

Asn Gly Leu Ser Gly Asn Asn He Arg Asn Gly Gin Gin He Val He 
485 490 495 



INFORMATION FOR SEQ ID NO: 5235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Eii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ II 

Leu Leu Ser He Lys Tyr Asn Leu 
1 5 

Lys His His Pro Lys Leu Arg Ser 
20 

Leu Gly Val Ala Ser Val He Val 
35 40 

Gin His Gin Ala Gin Ala Ala Glu 
50 55 

Ser Glu Asn Gin Asn Asn Asn Ala 
65 70 

Thr Asn Gin Thr Gin Pro Ala Thr 
85 

Tyr Pro Ala Ala Asp Glu Ser Leu 
100 

Leu Glu Asn Lys Glu His Asp He 
115 120 



> NO -.523 5: 

He Gly Val Val Asn Asn Met Asn 
10 15 

Phe Tyr Ser He Arg Lys Ser Thr 
25 30 

Ser Thr Leu Phe Leu He Thr Ser 
45 

Asn Thr Asn Thr Ser Asp Lys He 
60 

Thr Thr Thr Gin Pro Pro Lys Asp 
75 80 

Gin Pro Ala Asn Thr Ala Lys Asn 

90 95 

Lys Asp Ala He Lys Asp Pro Ala 
105 110 

Gly Pro Arg Glu Gin Val Asn Phe 
125 
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Gin Leu Leu Asp Lys Asn Asn Glu Thr Gin Tyr Tyr His Phe Phe Ser 
130 ' 135 140 

He Lys Asp Pro Ala Asp Val Tyr Tyr Thr Lys Lys Lys Ala Glu Val 
145 150 155 160 

Glu Leu Asp He Asn Thr Ala Ser Thr Trp Lys Lys Phe Glu Val Tyr 
165 170 175 

Glu Asn Asn Gin Lys Leu Pro Val Arg Leu Val Ser Tyr Ser Pro Val 
180 185 190 

Pro Glu Asp His Ala Tyr He Arg Phe Pro Val Ser Asp Gly Thr Gin 
195 200 205 

Glu Leu Lys He Val Ser Ser Thr Gin He Asp Asp Gly Glu Glu Thr 
210 215 220 

Asn Tyr Asp Tyr Thr Lys Leu Val Phe Ala Lys Pro He Tyr Asn Asp 
225 230 235 240 

Pro Ser Leu Val Lys Ser Asp Thr Asn Asp Ala Val Val Thr Asn Asp 
245 250 255 

Gin Ser Ser Ser Val Ala Ser Asn Gin Thr Asn Thr Asn Thr Ser Asn 
260 265 270 

Gin Asn He Ser Thr He Asn Asn Ala Asn Asn Gin Pro Gin Ala Thr 
275 280 285 

Thr Asn Met Ser Gin Pro Ala Gin Pro Lys Ser Ser Thr Asn Ala Asp 
290 295 300 

Gin Ala Ser Ser Gin Pro Ala His Glu Thr Asn Ser Asn Gly Asn Thr 
305 310 315 320 

Asn Asp Lys Thr Asn Glu Ser Ser Asn Gin Ser Asp Val Asn Gin Gin 
325 330 335 

Tyr Pro Pro Ala Asp Glu Ser Leu Gin Asp Ala He Lys Asn Pro Ala 
340 345 350 

He He Asp Lys Glu His Thr Ala Asp Asn Trp Arg Pro He Asp Phe 
355 360 365 

Gin Met Lys Asn Asp Lys Gly Glu Arg Gin Phe Tyr His Tyr Ala Ser 
370 375 380 

Thr Val Glu Pro Ala Thr Val He Phe Thr Lys Thr Gly Pro He He 
385 390 395 400 

Glu Leu Gly Leu Lys Thr Ala Ser Thr Trp Lys Lys Phe Glu Val Tyr 
405 410 415 

Glu Gly Asp Lys Lys Leu Pro Val Glu Leu Val Ser Tyr Asp Ser Asp 
420 425 430 

Lys Asp Tyr Ala Tyr He Arg Phe Pro Val Ser Asn Gly Thr Arg Glu 
435 440 445 
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Val Lys He Val Ser Ser lie Glu Tyr Gly Glu Asn He His Glu Asp 
450 455 460 

Tyr Asp Tyr Thr Leu Met Val Phe Ala Gin Pro He Thr Asn Asn Pro 
465 470 475 480 

Asp Asp Tyr Val Asp Glu Glu Thr Tyr Asn Leu Gin Lys Leu Leu Ala 
485 490 495 

Pro Tyr His Lys Ala Lys Thr Leu Glu Arg Gin Val Tyr Glu Leu Glu 
500 505 510 

Lys Leu Gin Glu Lys Leu Pro Glu Lys Tyr Lys Ala Glu Tyr Lys Lys 

515 520 525 

Lys Leu Asp Gin Thr Arg Val Glu Leu Ala Asp Gin Val Lys Ser Ala 
530 535 540 

Val Thr Glu Phe Glu Asn Val Thr Pro Thr Asn Asp Gin Leu Thr Asp 
545 550 555 560 

Leu Gin Glu Ala His Phe Val Val Phe Glu Ser Glu Glu Asn Ser Glu 
565 570 575 

Ser Val Met Asp Gly Phe Val Glu His Pro Phe Tyr Thr Ala Thr Leu 
580 585 590 

Asn Gly Gin Lys Tyr Val Val Met Lys Thr Lys Asp Asp Ser Tyr Trp 
595 600 605 

Lys Asp Leu He Val Glu Gly Lys Arg Val Thr Thr Val Ser Lys Asp 
610 615 620 

Pro Lys Asn Asn Ser Arg Thr Leu He Phe Pro Tyr He Pro Asp Lys 
625 630 635 640 

Ala Val Tyr Asn Ala He Val Lys Val Val Val Ala Asn He Gly Tyr 
645 650 655 

Glu Gly Gin Tyr His Val Arg He He Asn Gin Asp He Asn Thr Lys 
660 665 670 

Asp Asp Asp Thr Ser Gin Asn Asn Thr Ser Glu Pro Leu Asn Val Gin 
675 680 685 

Thr Gly Gin Glu Gly Lys Val Ala Asp Thr Asp Val Ala Glu Asn Ser 
690 695 700 

Ser Thr Ala Thr Asn Pro Lys Asp Ala Ser Asp Lys Ala Asp Val He 
705 710 715 720 

Glu Pro Glu Ser Asp Val Val Lys Asp Ala Asp Asn Asn He Asp Lys 
725 730 735 

Asp Val Gin His Asp Val Asp His Leu Ser Asp Met Ser Asp Asn Asn 
740 745 750 

His Phe Asp Lys Tyr Asp Leu Lys Glu Met Asp Thr Gin He Ala Lys 
755 760 765 
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Asp Thr Asp Arg Asn Val Asp Lys Asp Ala Asp Asn Ser Val Gly Met 
770 775 780 

Ser Ser Asn Val Asp Thr Asp Lys Asp Ser Asn Lys Asn Lys Asp Lys 
S 785 790 795 800 

Val He Gin Leu Asn His He Ala Asp Lys Asn Asn His Thr Gly Lys 
805 810 815 

w Ala Ala Lys Leu Asp Val Val Lys Gin Asn Tyr Asn Asn Thr Asp Lys 

820 825 830 

Val Thr Asp Lys Lys Thr Thr Glu His Leu Pro Ser Asp He His Lys 
835 840 845 

15 Thr Val Asp Lys Thr Val Lys Thr Lys Glu Lys Ala Gly Thr Pro Ser 

850 855 860 

Lys Glu Asn Lys Leu Ser Gin Ser Lys Met Leu Thr Lys Asn Trp Arg 
865 870 875 880 

20 

Asn Asn Xaa Gin Ala Asn 

885 

(2) INFORMATION FOR SEQ ID NO: 5236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5236: 

Asn Met Asn Lys Asn Val Met Val Lys Gly Leu Thr Ala Leu Thr He 

15 10 15 

Leu Thr Ser Leu Gly Phe Ala Glu Asn He Ser Asn Gin Xaa His Ser 
20 25 30 

He Ala Lys Ala Glu Lys Asn Val Lys Glu He Thr Asp Ala Thr Lys 
35 40 45 

Glu Pro Tyr Asn Ser Val Val Ala Phe Val Gly Gly Thr Gly Val Val 
50 55 60 

Val Gly Lys Asn Thr He Val Thr Asn Lys His He Ala Lys Ser Asn 
65 70 75 80 

Asp He Phe Lys Asn Arg Val Ser Ala His His Ser Ser Lys Gly Lys 

85 90 95 

Gly Gly Gly Asn Tyr Asp Val Lys Asp He Val Glu Tyr Pro Gly Lys 
100 105 110 
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Glu Asp Leu Ala lie Val His Val His Glu Thr Ser Thr Glu Gly Leu 
115 120 125 

Asn Phe Asn Lys Asn Val Ser Tyr Thr Lys Phe Ala Asp Gly Ala Lys 
130 135 140 

Val Lys Asp Arg lie Ser Val lie Gly Tyr Pro Lys Gly Ala Gin Thr 
145 150 155 * 160 

Lys Tyr Lys Met Phe Glu Ser Thr Gly Thr He Asn His He Ser Gly 
165 170 175 

Thr Phe Met Glu Phe Asp Ala Tyr Ala Gin Pro Gly Asn Ser Gly Ser 
180 185 190 

Pro Val Leu Asn Ser Lys His Xaa Leu He Gly He Leu Tyr Ala Gly 
195 200 205 

Ser Gly Lys Asp Glu Ser Glu Lys Asn Phe Gly Val Tyr Phe Thr Pro 
210 215 220 

Gin Leu Xaa Xaa Phe He Pro Asn Asn He Glu Lys 
225 230 235 

INFORMATION FOR SEQ ID NO: 52 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5237: 

Tyr Arg Leu Glu His Thr He Met Lys Met Arg Thr He Ala Lys Thr 
l 5 10 15 

Ser Leu Ala Leu Gly Leu Leu Thr Thr Gly Ala He Thr Val Thr Thr 
20 25 30 

Gin Ser Val Lys Ala Glu Lys He Gin Ser Thr Lys Val Asp Lys Val 
35 40 45 

Pro Thr Leu Lys Ala Glu Arg Leu Ala Met He Asn He Thr Ala Gly 
50 ' 55 60 

Ala Asn Ser Ala Thr Thr Gin Ala Ala Asn Thr Arg Gin Glu Arg Thr 
65 70 75 80 

Pro Lys Leu Glu Lys Ala Pro Asn Thr Asn Glu Glu Lys Thr Ser Ala 
85 90 95 

Ser Lys He Glu Lys He Ser Gin Pro Lys Gin Glu Glu Gin Lys Thr 
100 105 110 
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Leu Asn lie Ser Ala Thr Pro Ala Pro Lys Gin Glu Gin Ser Gin Thr 
115 120 125 

Thr Thr Glu Ser Thr Thr Pro Lys Thr Lys Val Thr Thr Pro Pro Ser 
130 135 140 

Thr Asn Thr Pro Gin Pro Met Gin Ser Thr Lys Ser Asp Thr Pro Gin 
145 150 155 " 160 

Ser Pro Thr He Lys Gin Ala Gin Thr Asp Met Thr Pro Lys Tyr Glu 
165 170 175 

Asp Leu Arg Ala Tyr Tyr Thr Lys Pro Ser Phe Glu Phe Glu Lys Gin 
180 185 190 

Phe Gly Phe Met Leu Lys Pro Trp Thr Thr Val Arg Phe Met Asn Val 
195 200 205 

He Pro Asn Arg Phe He Tyr Lys He Ala Leu Val Gly Lys Asp Glu 
210 215 220 

Lys Lys Tyr Lys Asp Gly Pro Tyr Asp Asn lie Asp Val Phe He Val 
225 230 235 240 

Leu Glu Asp Asn Lys Tyr Gin Leu Lys Lys Tyr Ser Val Gly Gly He 
245 250 255 

Thr Lys Thr Asn Ser Lys Lys Val Asn His Lys Val Glu Leu Ser He 
260 265 270 

Thr Lys Lys Asp Asn Gin Gly Met He Ser Arg Asp Val Ser Glu Tyr 
275 280 285 

Met He Thr Lys Glu Glu He Ser Leu Lys Glu Leu Asp Phe Lys Leu 
290 295 300 

Arg Lys Gin Leu He Glu Lys His Asn Leu Tyr Gly Asn Met Gly Ser 
305 310 315 320 

Gly Thr He Val He Lys Met Lys Asn Gly Gly Lys Tyr Thr Phe Glu 
325 330 335 

Leu His Lys Lys Leu Gin Glu His Arg Met Ala Asp Val He Asp Gly 
340 345 ~ 350 

Thr Asn He Asp Asn He Glu Val Asn He Lys 
355 360 

(2) INFORMATION FOR SEQ ID NO: 5238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5238: 

Phe Met Lys Phe Lys Ser Leu lie Thr Thr Thr Leu Ala Leu Gly Val 
15 10 15 

Leu Ala Ser Thr Gly Ala Asn Phe Asn Asn Asn Glu Ala Ser Ala Ala 
20 25 30 

Ala Lys Pro Leu Asp Lys Ser Ser Ser Ser Leu His His Gly Tyr Ser 
35 * 40 45 

Lys Val His Val Pro Tyr Ala He Thr Val Asn Gly Thr Ser Gin Asn 
50 55 GO 

He Leu Ser Ser Leu Thr Phe Asn Lys Asn Gin Asn He Ser Tyr Lys 
65 70 75 80 

Asp Leu Glu Asp Arg Val Lys Ser Val Leu Lys Ser Asp Arg Gly He 
85 90 95 

Ser Asp He Asp Leu Arg Leu Ser Lys Gin Ala Lys Tyr Thr Val Tyr 
100 105 110 

Phe Lys Asn Gly Thr Lys Lys Val lie Asp Leu Lys Ala Gly He Tyr 
115 120 125 

Thr Ala Asp Leu He Asn Thr Ser Glu He Lys Ala He Asn He Asn 
130 135 140 

Val Asp Thr Lys Lys Gin 
145 150 

(2) INFORMATION FOR SEQ ID NO: 5239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 9: 

Glu Lys Arg Phe Met Gin Met Ala Arg Lys Val Val Val Val Asp Asp 
15 10 15 

Glu Lys Pro He Ala Asp He Leu Glu Phe Asn Leu Lys Lys Glu Gly 

20 25 30 

Tyr Asp Val Tyr Cys Ala Tyr Asp Gly Asn Asp Ala Val Asp Leu He 
35 40 45 

Tyr Glu Glu Glu Pro Asp He Val Leu Leu Asp He Met Leu Pro Gly 
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Arg Asp Gly Met Glu Val Cys Arg Glu Val Arg Lys Lys Tyr Glu Met 
65 70 75 80 

Pro lie lie Met Leu Thr Ala Lys Asp Ser Glu lie Asp Lys Val Leu 
85 90 95 

Gly Leu Glu Leu Gly Ala Asp Asp Tyr Val Thr Lys Pro Phe Ser Thr 
100 105 110 

Arg Glu Leu lie Ala Arg Val Lys Ala Asn Leu Arg Arg His Tyr Ser 
115 120 125 

Gin Pro Ala Gin Asp Thr Gly Asn Val Thr Asn Glu lie Thr lie Lys 
130 135 140 

Asp He Val He Tyr Pro Asp Ala Tyr Ser He Lys Lys Arg Gly Glu 
145 150 155 160 

Asp He Glu Leu Thr His Arg Glu Phe Glu Leu Phe His Tyr Leu Ser 
165 170 175 

Lys His Met Gly Gin Val Met Thr Arg Glu His Leu Leu Gin Thr Val 
180 185 190 

Trp Gly Tyr Asp Tyr Phe Gly Asp Val Arg Thr Val Asp Val Thr He 
195 200 205 

Arg Arg Leu Arg Glu Lys He Glu Asp Asp Pro Ser His Pro Glu Tyr 
210 215 220 

He Val Thr Arg Arg Gly Val Gly Tyr Phe Leu Gin Gin His Glu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 5240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5240: 

Xaa Leu Ser Thr Val lie Gly Ala Xaa Leu Phe Phe Lys Ser Ser Val 
15 10 15 

Ser Leu Val Phe Lys Met Val Lys Lys Phe Arg Xaa Gly Val He Ser 
20 25 30 

Val Asn Asp Val Met Phe Ser Ser Ser He Met Tyr Arg He Lys Lys 
35 40 45 

Asn Ala Phe Ser Leu Thr Val Met Ala He He Ser Ala He Thr Val 
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Ser Val Leu Cys Phe Ala Ala He Ser Arg Ala Ser Leu Ser Ser Glu 
65 70 75 80 

He Lys Tyr Thr Ala Pro His Asp Val Thr lie Lys Asp Gin Gin Lys 
85 90 95 

Ala Asn Gin Leu Ala Ser Glu Leu Asn Asn Gin Lys He Pro His Phe 
100 105 110 

Tyr Asn Tyr Lys Glu Val He His Thr Lys Leu Tyr Lys Asp Asn Leu 



Phe Asp Val Lys Ala 
130 



(2) INFORMATION FOR SEQ ID NO: 5241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5241: 

Glu He Tyr He He Ala Asn Lys Gin Arg Arg Asp Asn Met Ala Val 
15 10 15 

Asn Val Arg Asp Tyr He Ala Glu Asn Tyr Gly Leu Phe He Asn Gly 
20 25 30 

Glu Phe Val Lys Gly Ser Ser Asp Glu Thr He Glu val Thr Asn Pro 
35 40 45 

Ala Thr Gly Glu Thr Leu Ser His He Thr Arg Ala Lys Asp Lys Asp 
50 55 60 

Val Asp His Ala Val Lys Val Ala Gin Glu Ala Phe Glu Ser Trp Ser 



Leu Thr Ser Lys Ser Glu Arg Ala Gin Met Leu Arg Asp He Gly Asp 
85 90 95 

Lys Leu Met Ala Gin Lys Asp Lys He Ala Met He Glu Thr Leu Asn 
100 105 110 

Asn Gly Lys Pro He Arg Glu Thr Thr Ala He Asp lie Pro Phe Ala 
115 120 125 

Ala Arg His Phe His Tyr Phe Ala Ser Val He Glu Thr Glu Glu Gly 
130 135 140 

Thr Val Asn Asp He Asp Lys Asp Thr Met Ser He Val Arg His Glu 
145 150 155 160 
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Pro He Gly Val Val Gly Ala Val Val Ala Trp Asn Phe Pro Met Leu 

165 170 175 

Leu Ala Ala Trp Lys He Ala Pro Ala He Ala Ala Gly Asn Thr He 
180 185 190 

Val He Gin Pro Ser Ser Ser Thr Pro Leu Ser Leu Leu Glu Val Ala 
195 200 205 

Lys He Phe Gin Glu Val Leu Pro Lys Gly Val Val Asn He Leu Thr 
210 215 220 

Gly Lys Gly Ser Glu Ser Gly Asn Ala He Phe Asn His Asp Gly Val 
225 230 235 " 240 

Asp Lys Leu Ser Phe Thr Gly Ser Thr Asp Val Gly Tyr Gin Val Ala 
245 250 255 

Glu Ala Ala Ala Lys His Leu Val Pro Ala Thr Leu Glu Leu Gly Gly 
260 265 270 

Lys Ser Ala Asn He He Leu Asp Asp Ala Asn Leu Asp Leu Ala Val 
275 280 285 

Glu Gly He Gin Leu Gly He Leu Phe Asn Gin Gly Glu Val Cys Ser 
290 295 300 

Ala Gly Ser Arg Leu Leu Val His Glu Lys He Tyr Asp Gin Leu Val 
305 310 315 " 320 

Pro Arg Leu Gin Glu Ala Phe Ser Asn He Lys Val Gly Asn Pro Gin 
325 330 335 

Asp Glu Ala Thr Gin Met Gly Ser Gin Thr Gly Lys Asp Gin Leu Asp 
340 345 350 

Lys He Gin Ser Tyr He Asp Ala Ala Lys Glu Ser Asp Ala Gin He 

355 360 365 

Leu Ala Gly Gly His Arg Leu Thr Glu Asn Gly Leu Asp Lys Gly Phe 
370 375 380 

Phe Phe Glu Pro Thr Leu He Ala Val Pro Asp Asn His His Lys Leu 
385 390 395 400 

Ala Gin Glu Glu He Phe Gly Pro Val Leu Thr Val He Lys Val Lys 
405 410 415 

Asp Asp Gin Glu Ala lie Asp He Ala Asn Asp Ser Glu Tyr Gly Leu 
420 425 430 

Ala Gly Gly Val Phe Ser Gin Asn He Thr Arg Ala Leu Asn He Ala 
435 440 445 

Lys Ala Val Arg Thr Gly Arg He Trp He Asn Thr Tyr Asn Gin Val 
450 455 460 

Pro Glu Gly Ala Pro Phe Gly Gly Tyr Lys Lys Ser Gly He Gly Arg 
465 470 475 480 
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Glu Thr Tyr Lys Gly Ala Leu Ser Asn Tyr Gin Gin Val Lys Asn lie 
485 490 495 

Tyr lie Asp Thr Ser Asn Ala Leu Lys Gly Leu Tyr 

5 ' 500 505 

(2) INFORMATION FOR SEQ ID NO: 5242: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 54 0 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5242: 

Asn His Leu Thr Ala Arg lie lie Asn Gin Glu Asp Asp Leu Met Asn 
15 10 15 

Leu Phe Arg Gin Gin Lys Phe Ser lie Arg Lys Phe Asn Val Gly lie 
20 25 30 

Phe Ser Ala Leu lie Ala Thr Val Thr Phe He Ser Thr Asn Pro Thr 
35 40 45 

Thr Ala Ser Ala Ala Glu Gin Asn Gin Pro Ala Gin Asn Gin Pro Ala 
50 55 60 

Gin Pro Ala Asp Ala Asn Thr Gin Pro Asn Ala Asn Ala Gly Ala Gin 
65 70 75 60 

Ala Asn Pro Thr Ala Gin Pro Ala Ala Pro Ala Asn Gin Gly Gin Pro 
85 90 95 

Ala Val Gin Pro Ala Asn Gin Gly Gly Gin Ala Asn Pro Ala Gly Gly 
100 105 110 

Ala Ala Gin Pro Asn Thr Gin Pro Ala Gly Gin Gly Asp Gin Ala Asp 
115 120 125 

Pro Asn Asn Ala Ala Gin Ala Gin Pro Gly Asn Gin Ala Thr Pro Ala 
130 135 140 

Asn Gin Ala Gly Gin Gly Asn Asn Gin Ala Thr Pro Asn Asn Asn Ala 
145 150 155 160 

Thr Pro Ala Asn Gin Thr Gin Pro Ala Asn Ala Pro Ala Ala Ala Gin 
165 170 175 

Pro Ala Ala Pro Val Ala Ala Asn Ala Gin Thr Gin Asp Pro Asn Ala 
180 185 190 

Ser Asn Thr Gly Glu Gly Ser He Asn Thr Thr Leu Thr Phe Asp Asp 
195 200 205 
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Thr Asp Lys Val 
225 



• Thr Asp Glu Asn Arg Gin Asp Pro Thr Val Thr Val 
215 220 

Asn Gly Tyr Ser Leu lie Asn Asn Gly Lys lie Gly 
230 235 240 

Phe Val Asn Ser Glu Leu Arg Arg Ser Asp Met Phe Asp Lys Asn Asn 
245 250 255 

■ Gin Ala Lys Gly Asn Val Ala Ala Leu Gly Arg Val 
265 270 



Ser Thr Asp His Gly Asn Phe Asn Gly He Ser Lys 
2B0 285 

Lys Pro Asp Ser Glu Leu He He Asn Phe Thr Thr 
295 300 

Ser Lys Gin Gly Ala Thr Asn Leu Val He Lys Asp 
310 315 320 

Ala Lys Lys Asn Thr Glu Leu Ala Thr Val Asn Val Ala Lys Thr Gly 
325 330 335 

. Phe Lys Val Pro Thr Asp Ala Asp Arg Leu Asp Leu 
345 350 



Asn Ala Asn Asp 
275 



Thr Val Asn Val 
290 



Met Gin Thr Asn 

305 



Gin Phe He Pro 
355 



Asp Asn Thr Ala Val Ala Asp Ala Ser Arg He Thr 
360 365 

> Gly Tyr Lys Tyr Tyr Ser Phe He Asp Asn Val Gly 
375 380 

Ser His Leu Tyr val Lys Asn Arg Asp Leu Ala Pro 
390 395 400 

Lys Ala Thr Asn Asn Lys Glu Tyr Thr He Asn Thr Glu He Gly Asn 
405 410 415 

s Gly Ala Ser Leu Lys Ala Asp Gin Phe Lys Tyr Glu 
425 430 



Thr Phe Pro Asn 
450 



Val Asn Tyr Asp 
465 



Gly Asn Glu Asp Ser Thr Val Leu Lys Asn Met Thr 
455 460 

Gin Asn Ala Asn Lys val Thr Phe Thr Ser Gin Gly 
470 475 480 

Val Thr Thr Ala Arg Gly Thr His Thr Lys Glu Val Leu Phe Pro Asp 
485 490 495 

i Leu Ser Tyr Lys Val Asn Val Ala Asn He Asp Thr 
505 510 

: Asp Phe Asn Glu Lys Leu Thr Tyr Arg Thr Ala Ser 
520 525 
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Asp Val Val lie Asn Asn Ala Gin Pro Glu Val His 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 524 3 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 3: 

lie Thr Leu Lys Thr Val Ser Gin Leu lie Asp Met Lys Gin Lys Gin 
15 10 15 

Thr Lys He Ser Met Val Thr Ala Tyr Asp Phe Pro Ser Ala Lys Gin 
20 25 30 

Val Glu Ala Ala Gly He Asp Met He Leu Val Gly Asp Ser Leu Gly 
35 40 45 

Met Thr Val Leu Gly Tyr Glu Ser Thr Val Gin Val Thr Leu Ala Asp 
50 55 60 

Met He His His Gly Arg Ala Val Arg Arg Gly Ala Pro Asn Thr Phe 
65 70 75 80 

Val Val Val Asp Met Pro He Gly Ala Val Gly He Ser Met Thr Gin 
85 90 95 

Asp Leu Asn His Ala Leu Lys Leu Tyr Gin Glu Thr Asn Ala Asn Ala 
100 105 110 

He Lys Ala Glu Gly Ala His He Thr Pro Phe He Glu Lys Ala Thr 
115 120 125 

Ala He Gly He Pro Val Val Ala His Leu Gly Leu Thr Pro Gin Ser 
130 135 140 

Val Gly Val Met Gly Tyr Lys Leu Gin Gly Ala Thr Lys Glu Ala Ala 
145 ' 150 155 160 

Glu Gin Leu He Leu Asp Ala Lys Asn Val Glu Gin Ala Gly Ala Val 
165 * 170 175 

Ala Leu Val Leu Glu Ala He Pro Asn Asp Leu Ala Glu Glu lie Ser 
180 185 190 

Lys His Leu Thr He Pro Val He Gly He Gly Ala Gly Lys Gly Thr 
195 200 205 

Asp Gly Gin Val Leu Val Tyr His Asp Met Leu Asn Tyr Gly Val Glu 
210 215 220 
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His Lys Ala Lys Phe Val Lys Gin Phe Ala Asp Phe Ser Val Gly Val 
225 230 235 240 

Asp Gly Leu Lys Gin Tyr Asp Gin Glu Val Lys Ser Gly Ala Phe Pro 
5 245 250 * 255 

Ser Glu Glu Tyr Thr Tyr Lys Lys Lys lie Met Asn Glu Val Asn Asn 
260 265 270 

w Asn Asp 



(2) INFORMATION FOR SEQ ID NO: 524 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 4: 

Ser Asp Asp Trp Pro Lys Ser He Thr Ser Leu Ser He Arg Gly Val 

15 10 15 

Arg Met Lys His Gin Glu Thr Thr Ser Gin Gin Tyr Asn Phe Ser He 

20 25 " 30 

He Lys His Gly Asp He Ser Thr Pro Gin Gly Phe Thr Ala Gly Gly 

35 40 45 

Met His He Gly Leu Arg Ala Asn Lys Lys Asp Phe Gly Trp He Tyr 

50 55 60 

Ser Ser Ser Leu Ala Ser Ala Ala Ala Val Tyr Thr Leu Asn Gin Phe 



Lys Ala Ala Pro Leu He Val Thr Glu Asp Thr Leu Gin Lys Ser Lys 
85 90 95 

Gly Lys Leu Gin Ala Leu Val Val Asn Ser Ala Asn Ala Asn Ser Cys 
100 105 110 

Thr Gly Gin Gin Gly He Asp Asp Ala Arg Gin Thr Gin Thr Trp Val 
115 120 125 

Ala Gin Gin Leu Gin He Pro Ser Glu His Val Ala Val Ala Ser Thr 
130 135 140 

Gly Val He Gly Glu Tyr Leu Pro Met Asp Lys He Lys Thr Gly Thr 
145 150 155 160 

Glu His He Lys Asp Ala Asn Phe Ala Thr Pro Gly Ala Phe Asn Glu 
165 170 175 
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Ala lie Leu Thr Thr Asp Thr Cys Thr Lys His He Ala Val Ser Leu 
180 185 190 

Lys He Asp Gly Lys Thr Val Thr He Gly Gly Ser Thr Lys Gly Ser 
195 200 205 

Gly Met He His Pro Asn Met Ala Thr Met Leu Ala Phe He Thr Thr 
210 215 220 

Asp Ala Ser He Glu Ser Asn Thr Leu His Gin Leu Leu Lys Ser Ser 
225 230 235 240 

Thr Asp His Thr Phe Asn Met He Thr Val Asp Gly Asp Thr Ser Thr 
245 250 255 

Asn Asp Met Val Leu Val Met Ala Asn His Gin Val Glu His Gin He 
260 265 270 

Leu Ser Gin Asp His Pro Gin Trp Glu Thr Phe Val Asp Ala Phe Asn 
275 280 285 

Phe Val Cys Thr Phe Leu Ala Lys Ala He Ala Arg Asp Gly Glu Gly 
290 295 300 

Ala Thr Lys Leu He Ser Val Asn val Ser Gly Ala Lys Ser He Ser 
305 310 315 320 

Asp Ala Arg Lys He Gly Lys Thr He Val Ser Ser Asn Leu Val Lys 
325 330 335 

Ser Ala He Phe Gly Glu Asp Ala Asn Phe Gly Arg He He Thr Ala 
340 345 350 

He Gly Tyr Ser Gly Cys Glu He Asp Pro Asn Cys Thr Tyr Val Gin 
355 360 365 

Leu Asn Gin He Pro Val Val Asp Lys Gly Met Ala val Leu Phe Asp 
370 375 380 

Glu Gin Ala Met Ser Asn Thr Leu Thr His Glu Asn Val Thr He Asp 
385 390 395 400 

Val Gin Leu Gly Leu Gly Asn Ala Ala Ala Thr Ala Tyr Gly Cys Asp 
405 410 * 415 

Leu Ser Tyr Asp Tyr Val Arg He Asn Ala Ser Tyr Arg Thr 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 524 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5245: 

Asn Pro Ala Leu Thr Val Phe Ala Phe lie Met lie He Ser He Leu 

15 10 15 

Leu Ala Tyr Val Phe Lys Trp Leu Gly Leu Val Asp Asp Val Leu Leu 

20 25 30 

Met Val He He He Ser Thr He Ser Leu Gly Val Val Val Pro Thr 

35 40 45 

Leu Lys Glu Met Asn He Met Arg Thr Thr He Gly Gin Phe He Leu 

50 55 60 

Leu Val Ala Val Leu Ala Asp Leu Val Thr Met He Leu Leu Thr Val 



Tyr Gly Ala He Asn Gly Gin Gly Gly Ser Thr He Trp Leu He Gly 
85 90 95 

He Leu Val Val Phe Thr Ala He Ser Tyr He Leu Gly Val Gin Phe 
100 105 110 

Lys Arg Met Ser Phe Leu Gin Lys Leu Met Asp Gly Thr Thr Gin He 
115 120 125 

Gly He Arg Ala Val Phe Ala Leu He He Leu Leu Val Ala Leu Ala 
130 135 140 

Glu Gly Val Gly Ala Glu Asn He Leu Gly Ala Phe Leu Ala Gly Val 
145 150 155 160 

Val Val Ser Leu Leu Asn Pro Asp Glu Glu Met Val Glu Lys Leu Asp 
165 170 175 

Ser Phe Gly Tyr Gly Phe Phe He Pro He Phe Phe He Met Xaa Gly 
180 185 190 

Val Asp Leu Asn He Pro Ser Leu He Lys Glu Pro Lys Leu Leu He 
195 200 205 

He He Pro He Leu He Val Ala Phe He He Ser Lys Leu He Pro 
210 215 220 

Val Met Phe He Arg Arg Trp Phe Asp Met Lys Thr Thr He Ala Ser 
225 230 235 240 

Ala Phe Leu Leu Thr Ser Thr Leu Ser Leu Val He Ala Ala Ala Lys 

245 250 255 

He Ser Glu Arg Leu Asn Ala He Ser Ala Glu Thr Ser Gly He Leu 
260 265 270 

He Leu Ser Ala Val He Thr Cys Val Phe Val Pro He He Phe Lys 
275 280 285 

Lys Leu Phe Pro Val Pro Asp Glu Phe Asn Arg Lys He Glu Val Ser 
290 295 300 
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Leu lie Gly Lys Asn Gin Leu Thr lie Pro He Ala Gin Asn Leu Thr 
305 310 315 320 

Ser Gin Leu Tyr Asp Val Thr Leu Tyr Tyr Arg Lys Asp Leu Ser Asp 
325 330 335 

Arg Arg Gin Leu Ser Asp Asp He Thr Met He Glu He Ala Asp Tyr 
340 345 350 

Glu Gin Asp Val Leu Glu Arg Leu Gly Leu Phe Asp Arg Asp He Val 
355 360 365 

Val Cys Ala Thr Asn Asp Asp Asp He Asn Arg Lys Val Ala Lys Leu 
370 375 ' 380 

Ala Lys Ala His Gin Val Glu Arg Val He Cys Arg Leu Glu Ser Thr 
385 390 395 400 

Thr Asp Asp Thr Glu Leu Val Asp Ser Gly He Glu He Phe Ser Ser 
405 410 415 

Tyr Leu Ser Asn Lys He Leu Leu Lys Gly Leu He Glu Thr Pro Asn 
420 425 430 

Met Leu Asn Leu Leu Ser Asn Val Glu Thr Ser Leu Tyr Glu He Gin 
435 440 445 

Met Leu Asn Tyr Lys Tyr Glu Asn He Gin Leu Arg Asn Phe Pro Phe 
450 455 460 

Gly Gly Asp He He 
465 

(2) INFORMATION FOR SEQ ID NO:5246: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 414 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



4s (xi) SEQUENCE DESCRIPTION: SEQ II 

Ala He He Val He Leu Leu Phe 

1 5 

He Ser He He Ser He Pro Leu 

so 20 

Leu Lys Leu Ser Asp Val Ser Leu 
35 40 

5s Thr Val Ala He Gly Arg Val He 

50 55 



5 NO: 5246 : 

Leu Arg Asn He Arg Thr Thr Ala 

10 15 

Ser Leu Leu Met Ala Leu He Ala 
25 30 

Asn He Leu Thr Leu Gly Ala Leu 
45 

Asp Asp Ser He Val Val Val Glu 

60 



3250 



EP0 786 519 A2 

Asn He Tyr Arg Arg Leu Thr Asp Ser Glu Glu Gin Leu Lys Gly Glu 
65 70 75 80 

Asn Leu He He Ser Ala Thr Thr Glu Val Phe Lys Pro He Met Ser 
85 90 95 

Ser Thr Leu Val Thr He He Val Phe Leu Pro Leu Val Phe Val Ser 
100 105 110 

Gly Ser Val Gly Glu Met Phe Arg Pro Phe Ala Leu Ala He Ala Phe 
115 120 125 

Ser Leu Leu Ala Ser Leu Leu Val Ser He Thr Leu Val Pro Ala Leu 
130 135 140 

Ala Ala Thr Leu Phe Lys Lys Gly Val Lys Arg Arg Asn Lys Gin His 
145 150 155 160 

Gin Glu Gly Leu Gly Val Val Ser Thr Thr Tyr Lys Lys Val Leu His 
165 170 175 

Trp Ser Leu Asn His Lys Trp He Val He He Leu Ser Thr Leu He 
180 185 190 

Leu Val Ala Thr He Val Phe Gly Gly Pro Arg Leu Gly Thr Ser Phe 
195 200 205 

He Ser Ala Gly Asp Asp Lys Phe Leu Ala lie Thr Tyr Thr Pro Lys 
210 215 220 

Pro Gly Glu Thr Glu Gin Ala Val Leu Asn His Ala Lys Asp Val Glu 
225 230 235 240 

Lys Tyr Leu Lys Gin Lys Lys His Val Lys Thr He Gin Tyr Ser Val 
245 250 255 

Gly Gly Ser Ser Pro Val Asp Pro Thr Gly Ser Thr Asn Ser Met Ala 
260 265 270 

He Met Val Glu Tyr Asp Asn Asp Thr Pro Asn Phe Asp Val Glu Ala 
275 280 285 

Asp Lys Val He Lys His Ala Asp Gly Phe Lys His Pro Gly Glu Trp 
290 295 300 

Lys Asn Gin Asp Leu Gly Thr Gly Ala Gly Asn Lys Ser Val Glu Val 
305 310 315 320 

Thr Val Lys Gly Pro Ser Met Asp Ala lie Lys Ser Thr Val Lys Asp 
325 330 335 

He Glu Gin Lys Met Lys Gin Val Lys Gly Leu Ala Asn Val Lys Ser 
340 345 350 

Asp Leu Ser Gin Thr Tyr Asp Gin Tyr Glu He Lys Val Asp Gin Asn 
355 360 365 

Lys Ala Ala Glu Asn Gly He Ser Ala Ser Gin Leu Ala Met His Leu 
370 375 380 
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Asn Glu Asn Leu Pro Glu Lys Thr Val Thr Thr Val Lys Glu Asn Gly 
385 390 395 400 

Lys Thr Val Asp Val Lys Val Lys Gin Asn Lys Gin Thr Ala 
405 410 

(2) INFORMATION FOR SEQ ID NO: 524 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 555 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5247: 

Gly Lys Pro Phe lie lie Gly Leu Gly Asp lie He Val Lys Lys Leu 
1 5 10 15 

Thr Thr He Leu Phe Gin Tyr Lys He Phe Pro Val Leu Met Phe Leu 
20 25 30 

Val Ser Thr Gly Leu Gly He He Val He Thr Gin Asn He Leu He 
35 " 40 45 

Ala Asp Phe Leu Ala Lys He He Arg His Gin Phe Gin Gly Leu Trp 
50 55 60 

He Val Leu Phe He Leu Leu Gly Val Leu Leu Leu Arg Ala Thr Val 
65 70 75 80 

Gin Phe Leu Asn Gin Trp Leu Gly Asp Thr Leu Ala Phe Lys Val Lys 
85 90 95 

His Met Leu Arg Gin Arg Val He Tyr Lys Asn Asn Gly His Pro He 
100 105 110 

Gly Glu Gin Met Thr He Leu Thr Glu Asn He Asp Gly Leu Ala Pro 
115 120 125 

Phe Tyr Lys Ser Tyr Leu Pro Gin Val Phe Lys Ser Met Met Val Pro 
130 135 140 

Leu He He He He Ala Met Phe Phe He His Phe Asn Thr Ala Leu 
145 150 155 160 

He Met Leu He Thr Ala Pro Phe He Pro Leu Phe Tyr He He Phe 
165 170 175 

Gly Leu Lys Thr Arg Asp Glu Ser Lys Asp Gin Met Thr Tyr Leu Asn 
180 185 190 

Gin Phe Ser Gin Arg Phe Leu Asn He Ala Lys Gly Leu Val Thr Leu 
195 200 205 
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Ser Thr Gin Phe Arg Thr Leu Thr Met Arg lie Leu Arg Ser Ala Phe 
225 230 235 240 

Leu Ser Gly Leu Met Leu Glu Phe He Ser Met Leu Gly He Gly Leu 
245 250 255 

Val Ala Leu Glu Ala Thr Leu Ser Leu Val Val Phe His Asn He Asp 
260 265 270 

Phe Lys Thr Ala Ala He Ala He He Leu Ala Pro Glu Phe Tyr Asn 
275 280 285 

Ala He Lys Asp Leu Gly Gin Ala Phe His Thr Gly Lys Gin Ser Glu 
290 295 300 

Gly Ala Ser Asp Val Val Phe Glu Phe Leu Glu Gin Pro Asn Tyr Asn 
305 310 315 320 

Asn Glu Phe Leu Leu Lys Tyr Glu Glu Asn Gin Lys Pro Phe He Gin 
325 330 335 

Leu Thr Asp He Ser Phe Arg Tyr Asp Asp Ser Asp Arg Leu Val Leu 
340 345 350 

Asn Asp Leu Asn Leu Glu He Phe Lys Gly Asp Gin He Ala Leu Val 
355 360 365 

Gly Pro Ser Gly Ala Gly Lys Ser Thr Leu Thr His Leu He Ala Gly 
370 375 380 

Val Tyr Gin Pro Thr He Gly Thr He Ser Thr Asn Gin Arg Asp Leu 
385 390 395 400 

Asn He Gly He Leu Ser Gin Gin Pro Tyr He Phe Ser Ala Ser He 
405 410 415 

Lys Glu Asn He Thr Met Phe Lys Asp He Glu Asn Asn Thr He Glu 
420 425 430 

Glu Val Leu Asp Glu Val Gly Leu Leu Asp Lys Val Gin Ser Phe Thr 
435 440 445 



Gly Gin Met Arg Arg He Glu Leu Cys Arg Leu Leu Val Met Lys Pro 
465 470 475 480 

Asp Leu Val He Phe Asp Glu Pro Ala Thr Gly Leu Asp He Gin Thr 
485 490 495 

Glu His Met He Gin Asn Val Leu Phe Gin His Phe Lys Asp Thr Thr 
500 505 510 

Met He Val He Ala His Arg Asp Asn Thr He Arg His Leu Gin Arg 
515 520 525 
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Arg Leu Tyr lie Glu Asn Gly Arg Leu He Ala Asp Asp Arg Asn He 

530 535 540 

Ser Val Asn He Thr Glu Asn Gly Asp Asp Leu 

545 550 555 

(2) INFORMATION FOR SEQ ID NO:5248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5248: 

Val Trp Lys Leu Lys Met Arg Trp He Lys Arg Lys Lys Lys Asn Phe 
1 5 10 15 

Leu Asn ser Lys Phe Asn Phe Asn Asn Gly Lys He Ala Thr Tyr Leu 
20 25 30 

Tyr Lys Glu Arg Thr Ala Met Trp Asn Lys Asn Arg Leu Thr Gin Met 
35 40 45 

Leu Ser lie Glu Tyr Pro He He Gin Ala Gly Met Ala Gly Ser Thr 
50 55 60 

Thr Pro Lys Leu Val Ala Ser Val Ser Asn Ser Gly Gly Leu Gly Thr 
65 70 75 80 

lie Gly Ala Gly Tyr Phe Asn Thr Gin Gin Leu Glu Asp Glu He Asp 
85 90 95 

Tyr Val Arg Gin Leu Thr Ser Asn Ser Phe Gly Val Asn Val Phe Val 
100 105 110 

Pro Ser Gin Gin Ser Tyr Thr Ser Ser Gin He Glu Asn Met Asn Ala 

115 120 125 

Trp Leu Lys Pro Tyr Arg Arg Ala Leu His Leu Glu Glu Pro Val Val 
130 135 140 

Lys He Thr Glu Glu Gin Gin Phe Lys Cys His lie Asp Thr He He 
145 150 155 160 

Lys Lys Gin Val Pro val Cys Cys Phe Thr Phe Gly He Pro Ser Glu 
165 170 175 

Gin He He Ser Arg Leu Lys Ala Ala Asn Val Lys Leu He Gly Thr 
180 185 190 

Ala Thr Ser Val Asp Glu Ala He Ala Asn Glu Lys Ala Gly Met Asp 
195 200 205 
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Ala lie Val Ala Gin Gly Ser Glu Ala Gly Gly His Arg Gly Ser Phe 
210 215 220 

Leu Lys Pro Lys Asn Gin Leu Pro Met Val Gly Thr lie Ser Leu Val 
225 230 235 240 

Pro Gin He Val Asp Val Val Ser He Pro Val He Ala Ala Gly Gly 
245 250 255 

He Met Asp Gly Arg Gly Val Leu Ala Ser He Val Leu Gly Ala Glu 
260 265 270 

Gly Val Gin Met Gly Thr Ala Phe Leu Thr Ser Gin Asp Ser Asn Ala 
275 280 285 

Ser Glu Leu Leu Arg Asp Ala He He Asn Ser Lys Glu Thr Asp Thr 
290 295 300 

Val He Thr Lys Ala Phe Ser Gly Lys Leu Ala Arg Gly He Asn Asn 
305 310 315 320 

Arg Phe He Glu Glu Met Ser Gin Tyr Glu Gly Asp He Pro Asp Tyr 
325 330 335 

Pro He Gin Asn Glu Leu Thr Ser Ser He Arg Lys Ala Ala Ala Asn 
340 345 350 

He Gly Asp Lys Glu Leu He His Met Trp Ser Gly Gin Ser Pro Arg 
355 360 ~ 365 

Leu Ala Thr Thr His Pro Ala Asn Thr He Met Ser Asn He He Asn 
370 375 380 

Gin He Asn Gin He Met Gin Tyr Lys 
385 390 

(2) INFORMATION FOR SEQ ID NO: 524 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 936 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 9: 

Asn Asp Phe Leu Lys Arg Gly Asn Lys Met Asn Met Lys Lys Lys Glu 
15 10 15 

Lys His Ala He Arg Lys Lys Ser He Gly Val Ala Ser Val Leu Val 
20 25 30 

Gly Thr Leu He Gly Phe Gly Leu Leu Ser Ser Lys Glu Ala Asp Ala 
35 40 45 
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Ser Glu Asn Ser Val Thr Gin Ser Asp Ser Ala Ser Asn Glu Ser Lys 
50 55 60 

Ser Asn Asp Ser Ser Ser Val Ser Ala Ala Pro Lys Thr Asp Asp Thr 
65 70 75 80 

Asn Val Ser Asp Thr Lys Thr Ser Ser Asn Thr Asn Asn Gly Glu Thr 
85 90 95 

Ser Val Ala Gin Asn Pro Ala Gin Gin Glu Thr Thr Gin Ser Ser ser 
100 105 110 

Thr Asn Ala Thr Thr Glu Glu Thr Pro Val Thr Gly Glu Ala Thr Thr 
115 120 125 

Thr Thr Thr Asn Gin Ala Asn Thr Pro Ala Thr Thr Gin Ser Ser Asn 
130 135 140 

Thr Asn Ala Glu Glu Leu Val Asn Gin Thr Ser Asn Glu Thr Thr Ser 
145 150 155 160 

Asn Asp Thr Asn Thr Val Ser Ser Val Asn Ser Pro Gin Asn Ser Thr 
165 170 175 

Asn Ala Glu Asn Val Ser Thr Thr Gin Asp Thr Ser Thr Glu Ala Thr 
180 185 190 

Pro Ser Asn Asn Glu Ser Ala Pro Gin Ser Thr Asp Ala Ser Asn Lys 
195 200 205 

Asp Val Val Asn Gin Ala Val Asn Thr Ser Ala Pro Arg Met Arg Ala 

210 215 220 



lie Thr Asn Gin Leu Thr Asn Val Thr Val Gly lie Asp Ser Gly Thr 
245 250 255 

Thr Val Tyr Pro His Gin Ala Gly Tyr Val Lys Leu Asn Tyr Gly Phe 

260 265 270 

Ser Val Pro Asn Ser Ala Val Lys Gly Asp Thr Phe Lys lie Thr Val 
275 280 285 



Pro He Met Ala Gly Asp Gin Val Leu Ala Asn Gly Val He Asp Ser 
305 310 315 320 

Asp Gly Asn Val He Tyr Thr Phe Thr Asp Tyr Val Asn Thr Lys Asp 
325 330 335 

Asp Val Lys Ala Thr Leu Thr Met Pro Ala Tyr He Asp Pro Glu Asn 
340 345 350 

Val Lys Lys Thr Gly Asn Val Thr Leu Ala Thr Gly He Gly Ser Thr 
355 360 365 
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Thr Ala Asn Lys ' 

370 



Tyr Asn Leu ! 
385 



Asn Thr Tyr Arg < 



Leu Val Asp Tyr Glu Lys Tyr Gly 1 
375 380 



i Giy Thr He Asp Gin He Asp Lys 
i 395 



He Tyr Val Asn Pro Ser Gly Asp 
410 



Thr Asn 
400 



Asn Val 
415 



lie Ala Pro Val 
420 



Ala Leu He Asp 
435 



Asn Ala Ala Asp 
450 



Glu Asp Val Thr , 
465 



Tyr Lys val Glu 



Leu Thr 
Gin Gin 
Leu Ser 



Gly Asn Leu Lys Pro Asn Thr Asp Ser Asn 
425 430 



Asn Thr Ser He Lys val Tyr Lys 
440 445 



Glu Ser Tyr Phe Val Asn Pro Glu 
455 460 



i Thr Pro Asp Asp Gin He Thr Thr i 
490 



Ala Leu Arg Ser 
515 



Asn Gly 
Thr Leu 
Asp Asn 



Gly Asp Gly He j 
545 



Gly Glu He Glu Pro He 
565 



Asp Ser Gly Ser 
580 



Asp Ser Thr Ser 
595 



Ala Ser Asp ser 
610 



Asp Ser 
Asp Ser 
Asp Ser 



His He Asp Pro Asn Ser Lys Gly 

505 510 



Tyr Gly Tyr Asn Ser Asn He He 
520 525 



Glu Val Ala Phe Asn Asn Gly Ser 
535 540 



Pro Glu Asp Ser Asp Ser Asp Pro 

570 



Asn Ser Asp Ser Gly Ser Asp Ser 
585 590 



Gly Ser Asp Ser Ala Ser Asp Ser 
600 605 



Ala Ser Asp Ser Asp Ser Ala Ser 
615 620 



Val Asp 

Asn Phe 

Asn Gin 
480 

Pro Tyr 
495 

Asp Leu 

Trp Arg 

Gly Ser 

Glu Pro 
560 

Gly Ser 
575 

Gly Ser 
Asp Ser 
Asp Ser 



■ Asp Ser Asp Asn Asp Ser Asp Ser i 
I 635 



Asp Ser Asp Ser 1 



■ Asp Ser Asp Ser Asp Ser Asp Ser 

650 



Asp Ser 

655 



• Asp Ser j 

I 

■ Asp Ser i 



Asp Ser 
Asp Ser 
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Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 

690 695 700 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 
705 710 715 720 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 
725 730 735 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 
740 745 750 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 
755 760 765 

Asp Ser Asp Ser Asp Ser Asp Ser Ala Ser Asp Ser Asp Ser Asp Ser 
770 775 780 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 
785 790 795 800 

Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 

805 810 815 

Glu Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser Asp Ser 

820 825 830 

Asp Ser Asp Ser Asp Ser Asp Ser Ala Ser Asp Ser Asp Ser Gly Ser 
835 840 845 

Asp Ser Asp Ser Ser Ser Asp Ser Asp Ser Glu Ser Asp Ser Asn Ser 

850 855 860 

Asp Ser Glu Ser Val Ser Asn Asn Asn Val Val Pro Pro Asn Ser Pro 
865 870 875 880 

Lys Asn Gly Thr Asn Ala Ser Asn Lys Asn Glu Ala Lys Asp Ser Lys 
885 890 895 

Glu Pro Leu Pro Asp Thr Gly Ser Glu Asp Glu Ala Asn Thr Ser Leu 
900 905 910 

lie Trp Gly Leu Leu Ala Ser lie Gly Ser Leu Leu Leu Phe Arg Arg 
915 920 925 

Lys Lys Glu Asn Lys Asp Lys Lys 
930 935 

(2) INFORMATION FOR SEQ ID NO: 5250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5250: 

Val Ser Lys Leu Lys Lys Glu lie Leu Glu Trp lie He Ser He Ala 

15 10 15 

Val Ala Phe Val He Leu Phe He Val Gly Lys Phe He Val Thr Pro 
20 25 30 

Tyr Thr He Lys Gly Glu Ser Met Asp Pro Thr Leu Lys Asp Gly Glu 
35 40 45 

Arg val Ala val Asn He val Gly Tyr Lys Thr Gly Gly Leu Glu Lys 
50 55 60 

Gly Asn Val val Val Phe His Ala Asn Lys Asn Asp Asp Tyr Val Lys 
65 70 75 80 

Arg Val He Gly Val Pro Gly Asp Lys Val Glu Tyr Lys Asn Asp Thr 
85 90 95 

Leu Tyr Val Asn Gly Lys Lys Gin Asp Glu Pro Tyr Leu Asn Tyr Asn 
100 105 110 

Leu Lys His Lys Gin Gly Asp Tyr He Thr Gly Thr Phe Gin Val Lys 
115 120 125 

Asp Leu Pro Asn Ala Asn Pro Lys Ser Asn Val He Pro Lys Gly Lys 
130 135 140 

Tyr Leu val Leu Gly Asp Asn Arg Glu Val Ser Lys Asp Ser Arg Ala 
145 150 155 160 

Phe Gly Leu He Asp Glu Asp Gin He Val Gly Lys Val Ser Phe Gin 
165 170 175 

Val Leu Ala His Phe Ser Glu Phe Gin Thr Ser He Ser Xaa Leu Lys 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 5251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5251: 

Leu Lys Ala Xaa Tyr Ala Lys Leu Asp Asp Val Ser Lys Phe Glu Asp 
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Val Thr Asp Asn Met Ser Leu Asp Phe Asp Thr Asn Gly Gly Tyr Ser 
20 25 30 

Leu Asn Phe Asn Asn Leu Asp Gin Ser Lys Asn Tyr Val lie Lys Tyr 
35 40 45 

Glu Gly Tyr Tyr Asp Ser Asn Ala Ser Asn Leu Glu Phe Gin Thr His 
50 55 60 

Leu Phe Gly Tyr Tyr Asn Tyr Tyr Tyr Thr Ser Asn Leu Thr Trp Lys 
65 70 75 80 

Asn Gly Val Ala Phe Tyr Ser Asn Asn Ala Gin Gly Asp Gly Lys Asp 
85 90 95 

Lys Leu Lys Glu Pro lie lie Glu His Ser Thr Pro lie Glu Leu Glu 
100 105 110 

Phe Lys Ser Glu Pro Pro val Glu Lys His Glu Leu Thr Gly Thr lie 
115 120 125 

Glu Glu Ser Asn Asp Ser Lys Pro He Asp Phe Glu Tyr His Thr Ala 
130 135 140 

Val Glu Gly Ala Glu Gly His Ala Glu Gly Thr He Glu Thr Glu Glu 
145 150 155 160 

Asp Ser He His Val Asp Phe Glu Glu Ser Thr His Glu Asn Ser Lys 
165 170 175 

His His Ala Asp Val Val Glu Tyr Glu Glu Asp Thr Asn Pro Gly Gly 
180 185 190 

Gly Gin Val Thr Thr Glu Ser Asn Leu Val Glu Phe Asp Glu Asp Ser 
195 200 205 

Thr Lys Gly He val Thr Gly Ala Val Ser Asp His Thr Thr He Glu 
210 215 220 

Asp Thr Lys Glu Tyr Thr Thr Glu Ser Asn Leu He Glu Leu Val Asp 
225 230 235 240 

Glu Leu Pro Glu Glu His Gly Gin Ala Gin Gly Pro He Glu Glu He 
245 250 255 

Thr Glu Asn Asn His His He Ser His Ser Gly Leu Gly Thr Glu Asn 
260 265 * 270 

Gly His Gly Asn Tyr Gly Val He Glu Glu He Glu Glu Asn Ser His 
275 280 285 

Val Asp He Lys Ser Glu Leu Gly Tyr Glu Gly Gly Gin Asn Ser Gly 
290 295 300 

Asn Gin Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro Lys Tyr Glu 
305 310 315 320 

Gin Gly Gly Asn He Val Asp He Asp Phe Asp Ser Val Pro Gin He 
325 330 335 
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His Gly Gin Asn Asn Gly Asn Gin Ser Phe Glu Glu Asp Thr Glu Lys 
340 345 350 

Asp Lys Pro Lys Tyr Glu Gin Gly Gly Asn lie lie Asp lie Asp Phe 
355 360 365 

Asp Ser Val Pro His He His Gly Phe Asn Lys His Thr Glu He He 
370 375 380 

Glu Glu Asp Thr Asn Lys Asp Lys Pro Asn Tyr Gin Phe Gly Gly His 
385 390 395 400 

Asn Ser Val Asp Phe Glu Glu Asp Thr Leu Pro Gin Val Ser Gly His 
405 410 415 

Asn Glu Gly Gin Gin Thr He Glu Glu Asp Thr Thr Pro Pro He Val 
420 425 430 

Pro Pro Thr Pro Pro Thr Pro Glu Val Pro Ser Glu Pro Glu Thr Pro 
435 440 445 

Thr Pro Pro Thr Pro Glu Val Pro Ser Glu Pro Glu Thr Pro Thr Pro 
450 455 460 

Pro Thr Pro Glu Val Pro Thr Glu Pro Gly Lys Pro He Pro Pro Ala 
465 470 475 480 

Lys Glu Glu Pro Lys Lys Pro Ser Lys Pro val Glu Gin Gly Lys val 

485 490 495 

Val Thr Pro Val He Glu He Asn Glu Lys Val Lys Ala Val Val Pro 

500 505 510 

Thr Lys Lys Ala Gin Ser Lys Lys Ser Glu Leu Pro Glu Thr Gly Gly 
515 520 525 

Glu Glu Ser Thr Asn Asn Gly Met Leu Phe Gly Gly Leu Phe Ser He 
530 535 540 

Leu Gly Leu Ala Leu Leu Arg Arg Asn Lys Lys Asn His Lys Ala 

545 550 555 

(2) INFORMATION FOR SEQ ID NO: 52 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5252: 

Thr Lys Asn Glu Lys lie Asn Asp Val Thr Ala Val Ala Glu Lys Glu 
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Val Val Glu Glu Thr Lys Ala Thr Gly Thr Asp Val Thr Asn Lys Val 

20 25 30 

Glu Val Glu Glu Gly Ser Glu He Val Gly His Lys Gin Asp Thr Asn 
35 40 45 

Val Val Asn Pro His Asn Ala Glu Arg Val Thr Leu Lys Tyr Lys Trp 

50 55 60 

Lys Phe Gly Glu Gly He Lys Ala Gly Asp Tyr Phe Asp Phe Thr Leu 
65 70 75 80 

Ser Asp Asn Val Glu Thr His Gly He Ser Thr Leu Arg Lys Val Pro 
85 90 95 

Glu He Lys Ser Thr Asp Gly Gin Val Met Ala Thr Gly Glu He He 

100 105 110 

Gly Glu Arg Lys Val Arg Tyr Thr Phe Lys Glu Tyr Val Gin Glu Lys 
115 120 125 

Lys Asp Leu Thr Ala Glu Leu Ser Leu Asn Leu Phe He Asp Pro Thr 
130 135 140 

Thr Val Thr Gin Lys Gly Asn Gin Asn Val Glu Val Lys Leu Gly Glu 
145 150 155 160 

Thr Thr Val Ser Lys He Phe Asn He Gin Tyr Leu Gly Gly Val Arg 
165 170 175 

Asp Asn Trp Gly Val Thr Ala Asn Gly Arg He Asp Thr Leu Asn Lys 
180 185 190 

Val Asp Gly Lys Phe Ser His Phe Ala Tyr Met Lys Pro Asn Asn Gin 
195 200 205 

Ser Leu Ser Ser Val Thr Val Thr Gly Gin Val Thr Lys Gly Asn Lys 
210 215 220 

Pro Gly Val Asn Asn Pro Thr Val Lys Val Tyr Lys His He Gly Ser 
225 230 235 240 

Asp Asp Leu Ala Glu Ser Xaa Xaa Cys Lys Ala 
245 250 

(2) INFORMATION FOR SEQ ID NO: 5253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:5253: 
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lie Leu His Leu Arg Glu Asn lie lie Val Lys Ser Asn Leu Arg Tyr 
1 5 10 15 

Gly He Arg Lys His Lys Leu Gly Ala Ala Ser Val Phe Leu Gly Thr 
20 25 30 

Met He Val Val Gly Met Gly Gin Glu Lys Glu Ala Ala Ala Ser Glu 
35 40 45 

Gin Asn Asn Thr Thr Val Glu Glu Ser Gly Ser Ser Ala Thr Glu Ser 
50 55 60 

Lys Ala Ser Glu Thr Gin Thr Thr Thr Asn Asn Val Asn Thr He Asp 
65 70 75 80 

Glu Thr Gin Ser Tyr Ser Ala Thr Ser Thr Glu Gin Pro Ser Gin Ser 
85 90 95 

Thr Gin Val Thr Thr Glu Glu Ala Pro Lys Thr Val Gin Ala Pro Lys 
100 105 110 

Val Glu Thr Ser Arg Val Asp Leu Pro Ser Glu Lys Val Ala Asp Lys 

115 120 125 

Glu Thr Thr Gly Thr Gin Val Asp He Ala Gin Pro Ser Asn Val Ser 
130 135 140 

Glu He Lys Pro Arg Met Lys Arg Ser Met Thr Leu Gin Gin Leu Gin 
145 150 155 160 

Arg Lys Lys 

(2) INFORMATION FOR SEQ ID NO: 5254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5254: 

He Leu His Leu Lys Gly Asp He He Val Lys Asn Asn Leu Arg Tyr 
15 10 15 

Gly He Arg Lys His Lys Leu Gly Ala Ala Ser Val Phe Leu Gly Thr 
20 25 30 

Met He Val Val Gly Met Gly Gin Asp Lys Glu Ala Ala Ala Ser Glu 
35 40 45 

Gin Lys Thr Thr Thr Val Glu Glu Asn Gly Asn Ser Ala Thr Asp Asn 

50 55 60 
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Lys Thr Ser Glu Thr Gin Thr Thr Ala Thr Asn Val Asn His He Glu 
65 70 75 80 

Glu Thr Gin Ser Tyr Asn Ala Thr Val Thr Glu Gin Pro Ser Asn Ala 
85 90 95 

Thr Gin Val Thr Thr Glu Glu Ala Pro Lys Ala Val Gin Ala Pro Gin 
100 105 110 

Thr Ala Gin Pro Ala Asn He Glu Thr Val Lys Glu Glu Val Val Lys 
115 120 125 

Glu Glu Ala Lys Pro Gin Val Lys Glu Thr Thr Gin Ser Gin Asp Asn 
130 135 140 

Ser Gly Asp Gin Arg Gin Val Asp Leu Thr Pro Lys Lys Ala Thr Gin 
145 150 155 160 

Asn Gin val Ala Glu Thr Gin val Glu Val Ala Gin Pro Arg Thr Ala 
165 170 175 

Ser Glu Ser Lys Pro Arg Val Thr Arg Ser Ala Asp Val Ala Glu Ala 
180 185 190 

Lys Glu Ala Ser Asn Ala Lys Val Glu Thr Gly Thr Asp Val Thr Ser 
195 200 205 

Lys val Thr Val Glu He Gly Ser He Glu Gly His Asn Asn Thr Asn 
210 215 220 

Lys Val Glu Pro His Ala Gly Gin Arg Ala Val Leu Lys Tyr Lys Leu 
225 230 235 240 

Lys Phe Glu Asn Gly Leu His Gin Gly Asp Tyr Phe Asp Phe Thr Leu 
245 250 255 

Ser Asn Asn Val Asn Thr His Gly Val Ser Thr Ala Arg Lys Val Pro 

260 265 270 

Glu He Lys Asn Gly Ser Val Val Met Ala Thr Gly Glu Val Leu Glu 
275 280 285 

Gly Gly Lys He Arg Tyr Thr Phe Thr Asn Asp He Glu Asp Lys Val 
290 295 300 

Asp Val Thr Ala Glu Leu Glu He Asn Leu Phe He Asp Pro Lys Thr 
305 310 315 " 320 

Val Gin Thr Asn Gly Asn Gin Thr He Thr Ser Thr Leu Asn Glu Glu 
325 330 335 

Gin Thr Ser Lys Glu Leu Asp Val Lys Tyr Lys Asp Gly He Gly Asn 
340 345 350 

Tyr Tyr Ala Asn Leu Asn Gly Ser He Glu Thr Phe Asn Lys Ala Asn 
355 360 365 

Asn Arg Phe Ser His Val Ala Phe He Lys Pro Asn Asn Gly Lys Thr 
370 375 380 
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Thr Ser Val Thr Val Thr Gly Thr Leu Met Lys Gly Ser Asn Gin Asn 
385 390 395 400 

Gly Asn Gin Pro Lys Val Arg lie Phe Glu Tyr Leu Gly Asn Asn Glu 
405 410 415 

Asp He Ala Lys Ser Val Tyr Ala Asn Thr Thr Asp Thr Ser Lys Phe 
420 425 430 

Lys Glu Val Thr Ser Asn Met Ser Gly Asn Leu Asn Leu Gin Asn Asn 
435 440 445 

Gly Ser Tyr Ser Leu Asn He Glu Asn Leu Asp Lys Thr Tyr Val Val 
450 455 460 

His Tyr Asp Gly Glu Tyr Leu Asn Gly Thr Asp Glu Val Asp Phe Arg 
465 470 475 480 

Thr Gin Met Val Gly His Pro Glu Gin Leu Tyr Lys Tyr Tyr Tyr Asp 
485 490 495 

Arg Gly Tyr Thr Leu Thr Trp Asp Asn Gly Leu Val Leu Tyr Ser Asn 
500 505 510 

Lys Ala Asn Gly Asn Glu Lys Asn Gly Pro He He Gin Asn Asn Lys 
515 520 525 

Phe Glu Tyr Lys Glu Asp Thr He Lys Glu Thr Leu Thr Gly Gin Tyr 
530 535 540 

Asp Lys Asn Leu Val Thr Thr Val Glu Glu Glu Tyr Asp Ser Ser Thr 
545 550 555 560 

Leu Asp He Asp Tyr His Thr Ala He Asp Gly Gly Gly Gly Tyr Val 
565 570 575 

Asp Gly Tyr He Glu Thr He Glu Glu Thr Asp Ser Ser Ala He Asp 
580 585 590 

He Asp Tyr His Thr Ala Val Asp Ser Glu Ala Gly His Val Gly Gly 
595 600 605 

Tyr Thr Glu Ser Ser Glu Glu Ser Asn Pro He Asp Phe Glu Glu Ser 
610 615 620 

Thr His Glu Asn Ser Lys His His Ala Asp Val Val Glu Tyr Glu Glu 
625 630 635 640 

Asp Thr Asn Pro Gly Gly Gly Gin Val Thr Thr Glu Ser Asn Leu Val 
645 650 655 

Glu Phe Asp Glu Glu Ser Thr Lys Gly He Val Thr Gly Ala Val Ser 
660 665 670 

Asp His Thr Thr Val Glu Asp Thr Lys Glu Tyr Thr Thr Glu Ser Asn 
675 680 685 

Leu lie Glu Leu Val Asp Glu Leu Pro Glu Glu His Gly Gin Ala Gin 
690 695 700 
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Gly Pro Val Glu Glu He Thr Lys Asn Asn His His He Ser His Ser 
705 710 715 720 

Gly Leu Gly Thr Glu Asn Gly His Gly Asn Tyr Asp Val He Glu Glu 
725 * 730 * 735 

He Glu Glu Asn Ser His Val Asp He Lys Ser Glu Leu Gly Tyr Glu 
740 745 750 

Gly Gly Gin Asn Ser Gly Asn Gin Ser Phe Glu Glu Asp Thr Glu Glu 
755 760 765 

Asp Lys Pro Lys Tyr Glu Gin Gly Gly Asn He Val Asp He Asp Phe 
770 775 780 

Asp Ser Val Pro Gin He His Gly Gin Asn Lys Gly Asn Gin Ser Phe 
785 790 795 800 

Glu Glu Asp Thr Glu Lys Asp Lys Pro Lys Tyr Glu His Gly Gly Asn 
805 810 815 

He He Asp He Asp Phe Asp Ser Val Pro His He His Gly Phe Asn 

820 825 830 

Lys His Thr Glu He He Glu Glu Asp Thr Asn Lys Asp Lys Pro Ser 
835 840 845 

Tyr Gin Phe Gly Gly His Asn Ser Val Asp Phe Glu Glu Asp Thr Leu 
850 855 860 

Pro Lys Val Ser Gly Gin Asn Glu Gly Gin Gin Thr He Glu Glu Asp 
865 870 875 880 

Thr Thr Pro Pro He Val Pro Pro Thr Pro Pro Thr Pro Glu Val Pro 
885 890 895 

Ser Glu Pro Glu Thr Pro Thr Pro Pro Thr Pro Glu Val Pro Ser Glu 
900 905 910 

Pro Glu Thr Pro Thr Pro Pro Thr Pro Glu Val Pro Ser Glu Pro Glu 
915 920 925 

Thr Pro Thr Pro Pro Thr Pro Glu Val Pro Ala Glu Pro Gly Lys Pro 
930 935 940 

Val Pro Pro Ala Lys Glu Glu Pro Lys Lys Pro Ser Lys Pro Val Glu 
945 950 955 960 

Gin Gly Lys Val Val Thr Pro Val He Glu He Asn Glu Lys Val Lys 
965 970 975 

Ala Val Ala Pro Thr Lys Lys Pro Gin Ser Lys Lys Ser Glu Leu Pro 
980 985 990 

Glu Thr Gly Gly Glu Glu Ser Thr Asn Lys Gly Met Leu Phe Gly Gly 
995 1000 1005 

Leu Phe Ser He Leu Gly Leu Ala Leu Leu Arg Arg Asn Lys Lys Asn 
1010 1015 1020 
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His Lys Ala 
1025 

(2) INFORMATION FOR SEQ ID NO: 5255: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5255: 

Gly Glu Lys Cys Met Phe Leu Ala Trp Asn Glu lie Arg Arg Asn Lys 
1 " 5 10 15 

Leu Lys Phe Gly Leu He He Gly Val Leu Thr Met He Ser Tyr Leu 
20 25 30 

Leu Phe Leu Leu Ser Gly Leu Ala Asn Gly Leu He Asn Met Asn Lys 
35 40 45 

Glu Gly He Asp Lys Trp Gin Ala Asp Ala He Val Leu Asn Lys Asp 
50 * 55 60 

Ala Asn Gin Thr Val Gin Gin Ser Val Phe Asn Lys Lys Asp He Glu 
65 70 75 80 

Asn Lys Tyr Lys Lys Gin Ala Thr Leu Lys Gin Thr Gly Glu He Val 
85 90 95 

Ser Asn Gly His Gin Lys Asp Asn Val Leu Val Phe Gly Val Glu Lys 
100 105 110 

Ser Ser Phe Leu Val Pro Ser Leu He Glu Gly His Lys Ala Thr Lys 
115 120 125 

Asp Asn Glu Val Leu Ala Asp Glu Thr Leu Lys Asn Lys Gly Leu Lys 
130 135 140 

Leu Gly Asp Thr Leu Ser Leu Ser Xaa Xaa Arg 
145 * 150 155 



1. Computer readable medium having recorded thereon a nucleotide sequence of the Staphylococcus aureus ge- 
nome as depicted in SEQ ID NOS1-5.1 91 , a representative fragment thereof or a nucleotide sequence at least 
95 % identical to a nucleotide sequence depicted in SEQ ID NOS:1-5,191. 

2. Computer readable medium having recorded thereon any one of the fragments of SEQ ID NOS:1-5,191 depicted 
in Tables 2 and 3 or a degenerate variant thereof. 
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3. The computer readable medium of claim 1 , wherein said medium is selected from the group consisting of a floppy 
disc, a hard disc, random access memory (RAM), read only memory (ROM) and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is selected from the group consisting of a floppy 
disc, a hard disc, random access memory (RAM), read only memory (ROM), and CD-ROM 

5. A computer-based system for identifying fragments of the Staphylococcus aureus genome of commercial impor- 
tance comprising the following elements 

(a) a data storage means comprising the nucleotide sequence of SEQ ID NOS1-5.1 91 , a representative frag- 
ment thereof, or a nucleotide sequence at least 95% identical to a nucleotide sequence of SEQ ID NOS: 
1-5,191; 

(b) search means for comparing a target sequence to the nucleotide sequence of the data storage means of 
step (a) to identify homologous sequence(s), and 

(c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments of the Staphylococcus aureus genome 
comprising the step of comparing a database comprising the nucleotide sequences depicted in SEQ ID NOS: 
1-5,191 , a representative fragment thereof, or a nucleotide sequence at least 95% identical to a nucleotide se- 
quence of SEQ ID NOS: 1-5, 191 with a target sequence to obtain a nucleic acid molecule comprised of a comple- 
mentary nucleotide sequence to said target sequence, wherein said target sequence is not randomly selected. 

7. A method for identifying an expression modulating fragment of Staphylococcus aureus genome comprising the 
step of comparing a database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-5, 191, a repre- 
sentative fragment thereof, or a nucleotide sequence at least 95% identical to the nucleotide sequence of SEQ ID 
NOS:1-5,191 with a target sequence to obtain a nucleic acid molecule comprised of a complementary nucleotide 
sequence to said target sequence, wherein said target sequence comprises sequences known to regulate gene 
expression 

8. A protein-encoding nucleic acid fragment of the Staphylococcus aureus genome, 

wherein said fragment comprises the nucleotide sequence of any one of the fragments of SEQ ID NOS: 
1-5,191 depicted in Tables 2 and 3 or a nucleotide sequence at least 95% identical to such a nucleotide sequence, 
or a degenerate variant of any of the aforementioned sequences. 

9. The nucleic acid fragment of claim 8 which is DNA 

10. The nucleic acid fragment of claim 8 which is RNA 

11 . A vector comprising a fragment of claim 8. 

12. A fragment of the Staphylococcus aureus genome, wherein said fragment modulates the expression of an operably 
liked open reading frame, wherein said fragment consists of the nucleotide sequence from about 10 to 200 bases 
in length which is 5' to any one of the open reading frames depicted in Tables 2 and 3 or a nucleotide sequence 
at least 95% identical to such a nucleotide sequence or a degenerate variant of any of the aforementioned se- 
quences. 

13. A vector comprising a fragment of claim 12. 

14. A organism which has been altered to contain any one of the fragments of the Staphylococcus aureus genome of 
claim 8. 

15. A method for producing a polypeptide in a host cell comprising the steps of: 

(a) incubating an organism of claim 14 under conditions where said fragment is expressed to produce said 
protein, and 

(b) isolating said protein 

16. An organism which has been altered to contain any one of the fragments of the Staphylococcus aureus genome 
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of claim 12. 

17. A method for regulating the expression of a nucleic acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule consisting of the nucleotide sequence from about 30 to 300 
bases 5' to any one of the fragments of the Staphylococcus aureus genome depicted in Seq ID Nos:1 -5,191 and 
Tables 2 and 3 or a nucleotide sequence at least 95% identical to such a nucleotide sequence or a degenerate 
variant of any of the aforementioned sequences 

18. A nucleic acid molecule being a homolog of any of the fragments of the Staphylococcus aureus genome of SEQ 
ID NOS: 1-5, 191 and Tables 2 and 3, wherein said nucleic acid molecule is produced by a process comprising the 
steps of: 

(a) screening a genomic DNA library using as a probe a target sequence defined by any of SEQ ID NOS: 
1-5,191 and Tables 2 and 3, including fragments thereof; 

(b) identifying members of said library which contain sequences that hybridize to said target sequence; 

(c) isolating the nucleic acid molecules from said members identified in step (b). 

19. A DNA molecule being a homolog of any one of the fragments of the Staphylococcus aureus genome of SEQ ID 
NOS: 1-5,1 91 and Tables 2 and 3, wherein said nucleic acid molecule is produced by a process comprising the 
steps of: 

(a) isolating mRNA, DNA, or cDNA produced from an organism, 

(b) amplifying nucleic acid molecules whose nucleotide sequence is homologous to amplification primers de- 
rived from said fragment of said Staphylococcus aureus genome to prime said amplification, 

(c) isolating said amplified sequences produced in step (b) 

20. A polypeptide encoded by a fragment of claim 8. 

21. An antibody which selectively binds to any one of the polypeptides of claim 20. 

22. A kit for analyzing samples for the presence of polynucleotides derived from Staphylococcus aureus, comprising 
at least one polynucleotide containing a nucleotide sequence of any one of the fragments SEQ ID NOS:1-5,191 
depicted in Tables 2 and 3 or a nucleotide sequence at least 95% identical thereto or a degenerate variant of any 
of the aforementioned sequences, that will hybridize to a staphylococcus aureus polynucleotide under stringent 
hybridization conditions, and a suitable container. 

23. A Staphylococcus aureus polypeptide comprising an amino acid sequence identical to an amino acid sequence 
selected from the group consisting of SEQ ID NOS:5,1 92 to 5,255 or comprising an amino acid sequence having 
at least 95% identity to such a sequence 

24. A Staphylococcus aureus polypeptide antigen comprising at least one epitope derived from a Staphylococcus 
aureus polypeptide selected from the group consisting of SEQ ID NOS:5, 1 92 to 5,255. 

25. A polypeptide comprising at least one epitope encoded by a Staphylococcus aureus amino acid sequence selected 
from the group consisting of the epitopic sequences listed in Table 4. 

26. The polypeptide of claim 24 or 26, wherein said polypeptide is fixed to a solid phase. 

27. A diagnostic kit for detecting Staphylococcus aureus infection comprising 

(a) an isolated polypeptide antigen of claim 24, and 

(b) means for detecting the binding of an antibody contained in a biological fluid to said antigen 

28. A vaccine composition comprising a polypeptide of claim 24 present in a pharmaceutical^ acceptable carrier. 



29. A method of vaccinating an individual against Staphylococcus aureus infection comprising, administering to an 
individual the vaccine composition of claim 28, 
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Figure 1 
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